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[Designation of Document] Specification 
[Title of the Invention] Novel Aggrecanase 
[Claim] 

[Claim 1] A me tallopro tease which comprises an 
5 amino acid sequence of from the 213th position to the 583rd 
position of an amino acid sequence represented by SEQ ID 
N0:1, or a metalloprotease which is an equivalent of said 
me tallopro tease . 

[Claim 2] A metalloprotease which comprises an 

10 amino acid sequence of from the 1st position to the 583rd 
position of an amino acid sequence represented by SEQ ID 
NO:l, or a metalloprotease which is an equivalent of said 
metalloprotease . 

[Claim 3] A metalloprotease having an aggrecanase 

15 activity, which consists of an amino acid sequence 

represented by SEQ ID NO:l, an amino acid sequence of from 
the 1st position to the 583rd position of the amino acid 
sequence represented by SEQ ID NO : 1 , an amino acid sequence 
of from the 213th position to the 950th position of the 

20 amino acid sequence represented by SEQ ID NO:l, or an amino 
acid sequence of from the 213th position to the 583rd 
position of the amino acid sequence represented by SEQ ID 
NO:l, or a metalloprotease which is an equivalent of said 
metalloprotease . 

25 [Claim 4] A metalloprotease as described in any one 

of claims 1 to 3, which has an aggrecanase activity. 



[Claim 5] A method for screening a substance 
capable of modifying the aggrecanase activity of a 
metalloprotease , which comprises allowing the 
metalloprotease as described in claim 4 to contact with a 
5 compound to be tested. 

[Claim 6] A gene which encodes an amino acid 
sequence of the metalloprotease as described in any one of 
claims 1 to 3. 

[Claim 7] A vector which comprises the gene 
10 described in claim 6. 

[Claim 8] A host cell which comprises the vector 
described in claim 7 . 

[Claim 9] A method for producing the 
metalloprotease described in any one of claims 1 to 3, 
15 which comprises using the host cell described in claim 8. 

[Claim 10] An antibody against the metalloprotease 
described in any one of claims 1 to 3, 

[Claim 11] A method for screening a substance 
capable of modifying the aggrecanase activity of a 
20 metalloprotease, which comprises allowing the 

metalloprotease as described in any one of claims 1 to 3 to 
contact with a compound to be tested. 

[Claim 12] An agent for inhibiting proteoglycan 
degradation, which comprises as an active ingredient a 
25 substance which inhibits metalloprotease described in claim 
4. 



[Claim 13] A gene represented by SEQ ID NO: 24 , 25, 
26, 27, 28, 29, 30 or 31, or a gene which is an equivalent 
of said gene. 

[Detailed Description of the Invention] 
5 [0001] 

[Technical Field to which the Invention Belongs] 

This invention relates to a novel metallopro tease, a 
gene coding for the metalloprotease, a method for producing 
the metalloprotease, a method for screening a substance 
10 capable of modifying the aggrecanase activity using the 
metalloprotease, an agent for inhibiting proteoglycan 
degradation, which comprises as an active ingredient a 
substance which inhibits a metalloprotease having an 
aggrecanase activity, and a promoter gene of the 
1 5 metalloprotease . 

[0002] 
[Prior Art] 

Joint diseases are diseases which show damage and 
degeneration of joint cartilage as the main morbid states. 

20 Though a disease having the most frequent number of 
patients among joint diseases is osteoarthritis (OA) , 
analgesic anti-inflammatory drugs and hyaluronic acid 
preparations are used in the current therapeutic method 
merely as a symptomatic therapy for the purpose of 

25 alleviating pains accompanied by the degeneration of 

cartilage and the destruction of bone under cartilage, so 
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that it cannot be said that they are exerting sufficient 
therapeutic effects . 

Joint cartilage is a tissue mainly composed of type 
II collagen and aggrecan which is a cartilage- specific 
5 proteoglycan, and degradation and degeneration of both of 
them are observed in the joint diseases. Because of this, 
it has been considered for a long time that control of the 
degradation and degeneration of these extracellular matrix 
components would lead to the treatment of joint diseases, 

10 so that attempts have been positively made to identify 
degradation-concerned proteases (collagenase and 
aggrecanase) and to screen their inhibitors and develop 
them as medicaments. 

As proteases having collagenase activities , matrix 

15 me tallopro teases (MMPl , MMP8 , MMP13, MMP14 and the like) 
have been identified, and their selective inhibitors have 
been discovered. However, in spite of the attempts to 
develop a large number of MMP inhibitors having collagenase 
inhibition activities as therapeutic drugs for joint 

20 diseases including OA and rheumatic arthritis (RA) , MMP 

inhibitors to be used in these diseases as the indication 
have not been put on the market. Under such circumstances, 
attention has been directed toward aggrecanase which 
selectively degrades aggrecan which is another main 

25 constituting component of joint cartilage. 
[0003] 



A joint disease-related role of an enzyme aggrecanase 
which cleaves aggrecan at the site between Glu373-Ala374 
has been revealed by the reports of Sandy et al. and 
Lohmander et al. stating that all of the main digested 
5 aggrecan fragments found in the synovial fluid of human 
arthritis patients were generated by cleaving at the 
aggrecanase digestion site (Sandy J.D. et al. , J. Clin. 
Invest., 89, 1512 - 1516, 1992; Lohmander L.S. et al. , 
Arthritis Rheum., 36, 1214 - 1222, 1993). On the other 

10 hand, it has been known that, in an in vitro explant 

culture system of joint cartilage, degradation of aggrecan 
firstly occurs by IL-1 induction and then degradation of 
type II collagen is accelerated (Dingle L.T. et al. , Ann. 
Rheum. Dis. , 34, 303 - 311, 1975; Cawston T.E. et al. , 

15 Biochem. Biophys. Res. Comm., 215, 377 - 385, 1995; Kozaci 
L.D. et al., Arthritis Rheum. , 40, 164 - 174, 1997). It 
has been reported that the aggrecan degradation takes the 
precedence of the type II collagen degradation in a mouse 
arthritis model too (van Meurs J.B. et al. , Arthritis 

20 Rheum., 42, 1128 - 1139, 1999). These reports suggest a 
possibility that the type II collagen degradation can be 
controlled by inhibiting the preceding aggrecan 
degradation . 
[0004] 

25 However, the entity of the aggrecanase has been 

unclear for long time, though its biochemical properties 



had been elucidated , namely it is a me tallopro tease, it 
exists in outside of cells, a glycosaminoglycan side chain 
is concerned in its substrate recognition, its activity is 
induced by IL-1, TNF and retinoic acid, and the like. 
5 Recently, ADAMTS4 (aggrecanase-1 : Tortorella M.D. et al. , 
Science, 284, 1664 - 1666, 1999) and ADAMTS11 (aggrecanase- 
2: Abbaszade I. et al. , J. Biol. Chem. , 274, 23443 - 23450, 
1999) have been reported as proteases having an aggrecanase 
activity. However, because their gene expression in human 

10 OA cartilage is not increased, and their gene expression in 
an in vitro explant culture system of human knee joint 
cartilage is not induced by IL-1, TNF and retinoic acid 
which induce the aggrecanase activity that causes joint 
diseases, existence of another protease relating to joint 

15 deseases was suggested. (Flannery C.R. et al., Blochem. 
Biophys. Res. Common. , 260, 318 - 322, 1999). 
[0005] 

[Problems that the Invention is to Solve] 
The objects of the invention are to provide 
20 recombinant proteins by isolating and identifying genes 
coding for aggrecanases which are concerned in the onset 
and progress of joint diseases, particularly osteoarthritis 
as a disease having the most frequent number of patients 
and thereby constructing their expression production 
25 systems, to enable screening of a substance capable of 
modifying the aggrecanase activity by constructing a 



screening system of the substance, and to show proteoglycan 
degradation inhibition action of the substance. 
[0006] 

[Means for Solving the Problems] 
5 Under such circumstances, the present inventors have 

conducted intensive studies and succeeded as a result in 
isolating a gene coding for a novel protein classified into 
the ADAMTS family, determining its complete ORF sequence 
and thereby enabling production of a recombinant protein. 

10 Also, a vector containing this gene, a host cell 

containing this vector and a method for producing the novel 
protein using this host cell were established. Based on 
these, screening of compounds, peptides and antibodies 
capable of modifying protease activity of the protein was 

15 enabled and the invention was accomplished. 

In addition, it was found that this protein has the 
activity to cut the extracellular substrate aggrecan 
selectively between Glu373-Ala374, namely an aggrecanase 
activity as one of the protease activities, which suggested 

20 a possibility of the use of a compound capable of 

significantly modifying this protein and activity of the 
protein as a medicament. 

Thereafter, the invention has been accomplished by 
finding a compound which significantly inhibits aggrecanase 

25 activity of this protein and showing that this compound has 
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an action to inhibit degradation of proteoglycans in a 
system which uses joint cartilage primary culture cells. 
[0007] 

Accordingly, the invention relates to 
5 (1) a me tallopro tease which comprises an amino acid 

sequence of from the 213th position to the 583rd position 
of an amino acid sequence represented by SEQ ID NO : 1 , or a 
metalloprotease which is an equivalent of said 
metalloprotease . 

10 (2) a metalloprotease which comprises an amino acid 

sequence of from the 1st position to the 583rd position of 
an amino acid sequence represented by SEQ ID NO : 1 , or a 
metalloprotease which is an equivalent of said 
metalloprotease . 

15 (3) a metalloprotease having an aggrecanase activity, 

which consists of an amino acid sequence represented by SEQ 
ID NO:l, an amino acid sequence of from the 1st position to 
the 583rd position of the amino acid sequence represented 
by SEQ ID NO:l, an amino acid sequence of from the 213th 

20 position to the 950th position of the amino acid sequence 
represented by SEQ ID NO:l, or an amino acid sequence of 
from the 213th position to the 583rd position of the amino 
acid sequence represented by SEQ ID NO:l, or a 
metalloprotease which is an equivalent of said 

2 5 metalloprotease . 
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(4) a metalloprotease as described in any one of (1) 
to (3) , which has an aggrecanase activity. 

(5) a method for screening a substance capable of 
modifying the aggrecanase activity of a metalloprotease, 

5 which comprises allowing the metalloprotease as described 
in (4) to contact with a compound to be tested. 

(6) a gene which encodes an amino acid sequence of 
the metalloprotease as described in any one of (1) to (3) . 

(7) a vector which comprises the gene described in 

10 (6). 

(8) a host cell which comprises the vector described 
in (7) . 

(9) a method for producing the metalloprotease 
described in any one of (1) to (3) , which comprises using 

15 the host cell described in (8) . 

(10) an antibody against the metalloprotease 
described in any one of (1) to (3) , 

(11) a method for screening a substance capable of 
modifying the aggrecanase activity of a metalloprotease, 

20 which comprises allowing the metalloprotease as described 
in any one of (1) to (3) to contact with a compound to be 
tested. 

(12) an agent for inhibiting proteoglycan 
degradation, which comprises as an active ingredient a 

25 substance which inhibits metalloprotease described (4) . 
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(13) a gene represented by SEQ ID NO:24, 25, 26, 27 , 
28, 29, 30 or 31, or a gene which is an equivalent of said 
gene . 

[0008] 

5 [Mode for Carrying Out the Invention] 

The following describes the terms used in the 
invention . 

The "metalloprotease" as used herein means a 
"metal lopro tease " which has a zinc coordination consensus 
10 sequence (HExxH) and has a protease activity. Also, unless 
otherwise noted, the "protease" is referred to as 
"protein" . 

The novel metal lopro tease of the invention is any of 
a metalloprotease which contains an amino acid sequence of 

15 from the 213th position to the 583rd position of the amino 
acid sequence represented by SEQ ID N0:1 and has a protein 
degradation ability, or a metalloprotease which is an 
equivalent of the metalloprotease. 

Also, the novel metalloprotease of the invention any 

20 of a metalloprotease which contains an amino acid sequence 
of from the 1st position to the 583rd position of the amino 
acid sequence represented by SEQ ID NO:l and has a protein 
degradation ability, or a metalloprotease which is an 
equivalent of the metalloprotease. 

25 In addition, it is any of a metalloprotease which has 

the amino acid sequence represented by SEQ ID NO:l, an 

10 



amino acid sequence of from the 1st position to the 583rd 
position of the amino acid sequence represented by SEQ ID 
N0:1, an amino acid sequence of from the 213th position to 
the 950th position of the amino acid sequence represented 
5 by SEQ ID N0:1 or an amino acid sequence of from the 213th 
position to the 583rd position of the amino acid sequence 
represented by SEQ ID NO:l and has a protein degradation 
ability, or a metalloprotease which is an equivalent of 
this metalloprotease. 

10 Regarding the "equivalent of the metalloprotease", 

(1) in the case of an equivalent of the metalloprotease 
containing an amino acid sequence of from the 213th 
position to the 583rd position of the amino acid sequence 
represented by SEQ ID NO : 1 and having a protein degradation 

15 ability, it is a metalloprotease in which one to several 
amino acid residues may be substituted, deleted and/or 
inserted at one to several positions in the amino acid 
sequence of from the 213th position to the 583rd position, 
and which has the protein degradation ability, (2) in the 

20 case of an equivalent of the metalloprotease containing an 
amino acid sequence of from the 1st position to the 583rd 
position of the amino acid sequence represented by SEQ ID 
NO:l and having a protein degradation ability, it is a 
metalloprotease in which one to several amino acid residues 

25 may be substituted, deleted and/or inserted at one to 

several positions in the amino acid sequence of from the 
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1st position to the 583rd position, and which has the 
protein degradation ability , or (3) in the case of an 
equivalent of the metalloprotease containing the amino acid 
sequence represented by SEQ ID NO : 1 , an amino acid sequence 
5 of from the 1st position to the 583rd position of the amino 
acid sequence represented by SEQ ID NO : 1 , an amino acid 
sequence of from the 213th position to the 950th position 
of the amino acid sequence represented by SEQ ID NO:l or an 
amino acid sequence of from the 213th position to the 583rd 

10 position of the amino acid sequence represented by SEQ ID 
NO:l and having a protein degradation ability, it is a 
metalloprotease in which one to several amino acid residues 
may be substituted, deleted and/or inserted at one to 
several positions in respective sequences , and which has 

15 the protein degradation ability. Preferred as the 

"equivalent of the metalloprotease" is the metalloprotease 
formed by amino acid substitution such as SNP. 

Preferred as the novel metalloprotease of the 
invention are a polypeptide having the amino acid sequence 

20 represented by SEQ ID NO : 1 , a polypeptide having an amino 
acid sequence of from the 1st position to the 583rd 
position of the amino acid sequence represented by SEQ ID 
NO:l, a polypeptide having an amino acid sequence of from 
the 213th position to the 950th position of the amino acid 

25 sequence represented by SEQ ID NO:l and a polypeptide 

having an amino acid sequence of from the 213th position to 
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the 583rd position of the amino acid sequence represented 
by SEQ ID N0:1, particularly a polypeptide having an amino 
acid sequence of from the 213th position to the 950th 
position of the amino acid sequence represented by SEQ ID 
5 NO: 1 and a polypeptide having an amino acid sequence of 

from the 213th position to the 583rd position of the amino 
acid sequence represented by SEQ ID NO : 1 . 
[0009] 

The gene coding for the novel metalloprotease of the 
10 invention is any gene which contains a nucleotide sequence 
coding for the metalloprotease, namely a metalloprotease 
gene which contains a nucleotide sequence coding for an 
amino acid sequence of from the 213th position to the 583rd 
position of the amino acid sequence represented by SEQ ID 
15 NO :1 or an equivalent of the metalloprotease and which has 
a protein degradation ability. 

Also, the gene coding for the novel metalloprotein of 
the invention may be any metalloprotease gene which 
contains a nucleotide sequence coding for a metalloprotease 
20 represented by an amino acid sequence of from the 1st 

position to the 583rd position of the amino acid sequence 
shown in SEQ ID NO:l or for an equivalent of the 
metalloprotease and which has a protein degradation 
ability. 

25 In addition , it may be any metalloprotease gene which 

has a nucleotide sequence coding for a metalloprotease 
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represented by the amino acid sequence shown in SEQ ID 
N0:1, or by an amino acid sequence of from the 1st position 
to the 583rd position of the amino acid sequence shown in 
SEQ ID NO : 1 , an amino acid sequence of from the 213th 
5 position to the 950th position of the amino acid sequence 
shown in SEQ ID NO:l or an amino acid sequence of from the 
213th position to the 583rd position of the amino acid 
sequence shown in SEQ ID NO : 1 , or for an equivalent of the 
metalloprotease, and which has a protein degradation 
10 ability. 

Regarding the "nucleotide sequence coding for an 
equivalent of the me tallopro tease" , (1) in the case of a 
nucleotide sequence coding for an equivalent of the 
metalloprotease containing an amino acid sequence of from 

15 the 213th position to the 583rd position of the amino acid 
sequence represented by SEQ ID NO:l and having a protein 
degradation ability, it is a nucleotide sequence coding for 
a metalloprotease in which one to several amino acid 
residues may be substituted, deleted and/or inserted at one 

20 to several positions in the amino acid sequence of from the 
213th position to the 583rd position, and which has the 
protein degradation ability, (2) in the case of a 
nucleotide sequence coding for an equivalent of the 
metalloprotease containing an amino acid sequence of from 

25 the 1st position to the 583rd position of the amino acid 
sequence represented by SEQ ID N0:1 and having a protein 

14 



degradation ability, it is a nucleotide sequence coding for 
a metalloprotease in which one to several amino acid 
residues may be substituted, deleted and/or inserted at one 
to several positions in the amino acid sequence of from the 
5 1st position to the 583rd position, and which has the 
protein degradation ability, or (3) in the case of a 
nucleotide sequence coding for an equivalent of the 
metalloprotease containing the amino acid sequence 
represented by SEQ ID N0:1, an amino acid sequence of from 

10 the 1st position to the 583rd position of the amino acid 

sequence represented by SEQ ID NO:l, an amino acid sequence 
of from the 213th position to the 950th position of the 
amino acid sequence represented by SEQ ID NO:l or an amino 
acid sequence of from the 213th position to the 583rd 

15 position of the amino acid sequence represented by SEQ ID 
NO:l and having a protein degradation ability, it is a 
nucleotide sequence coding for a metalloprotease in which 
one to several amino acid residues may be substituted, 
deleted and/or inserted at one to several positions in 

20 respective sequences, and which has the protein degradation 
ability. Preferred as the "nucleotide sequence coding for 
an equivalent of the metalloprotease" is a nucleotide 
sequence coding for the metalloprotease formed by amino 
acid substitution such as SNP. 

25 Regarding the gene coding for the novel 

metalloprotease of the invention, it is preferably a gene 

15 



having from the 1st position to the 1749th position, from 
the 1st position to the 2850th position, from the 637th 
position to the 1749th position or from the 637th position 
to the 2850th position, of the nucleotide sequence 
5 described in SEQ ID NO: 2 , more preferably a gene having 

from the 637th position to the 1749th position or from the 
637th position to the 2850th position, of the nucleotide 
sequence described in SEQ ID NO: 2 . 

A promoter gene is included in the invention, and the 
10 promoter gene of the invention is preferably a gene having 
a nucleotide sequence described in SEQ ID NO: 24, 25, 26, 

27, 28, 29, 30 or 31. In the "gene as an equivalent of the 
gene described in SEQ ID NO:24, 25, 26, 27, 28, 29, 30 or 
31", one to several bases may be substituted, deleted 

15 and/or inserted at one to several positions in the 

nucleotide sequence described in SEQ ID NO: 24, 25, 26, 27, 

28, 29, 30 or 31. 

The novel metalloprotease of the invention has 
various protease activities, and it is a particularly 
20 noteworthy point that it has an aggrecanase activity as one 
of these activities . 

Also, the metalloprotease of the invention having the 
aggrecanase activity can be used in the screening of 
substances capable of modifying the aggrecanase activity. 
25 Among substances capable of modifying the metalloprotease 

having aggrecanase activity, a substance which inhibits the 

16 



aggrecanase activity is useful as a proteoglycan 
degradation inhibitor. In addition, the promoter gene of 
the metalloprotease of the invention exists in two or more 
variant forms. Accordingly, it is a noteworthy point that 
5 a substance capable of modifying the promoter activity can 
be screened using a desired variant of the promoter gene in 
response to each purpose. 
[0010] 

Now, a gene coding for the novel protein of the 
10 invention, a vector of the invention, a host cell of the 

invention, a method for producing the novel protein of the 
invention, a method for detecting activity of the novel 
protein of the invention, a method for producing an 
antibody which reacts with the novel protein of the 
15 invention, a method for screening a substance capable of 

modifying activity of the novel protein of the invention, a 
method for detecting promoter activity and a method for 
screening a substance capable of modifying the promoter 
activity are described in the following 1) to 9) . All of 
20 the items described in 1) to 9) are included in the 
invention . 

[0011] 

1) Production method of protein gene 

a) First production method - a method which uses PGR 
25 A mRNA sample is extracted from a human cell or 

tissue having the ability to produce the novel protein of 

17 



the invention. Next, using this mRNA as the template, two 
primers interposing the mRNA or a part of mRNA of the novel 
protein are prepared. Full-length cDNA or a part thereof 
corresponding to the novel protein can be obtained by 
5 modifying denature temperature, denaturing agent adding 
condition and the like and carrying out a reverse 
transcriptase-polymerase chain reaction (to be referred to 
as RT-PCR hereinafter) suited for a respective protein 
comprising a part of the amino acid sequence represented by 

10 SEQ ID NO:l of the invention. Alternatively, full-length 
cDNA or a part thereof corresponding to the novel protein 
can be obtained by carrying out a polymerase chain reaction 
(to be referred to as RT-PCR hereinafter) , by using cDNA 
prepared using reverse transcriptase from mRNA which is 

15 extracted from a human cell or tissue having the ability to 
produce the novel protein of the invention, or a 
commercially available cDNA preparation derived from a 
human cell or tissue, as the template. Thereafter, the 
novel protein can be produced by integrating the thus 

20 obtained full-length cDNA or a part thereof corresponding 
to the novel protein into an appropriate expression vector 
and expressing it in a host cell. 

Firstly, mRNA comprising a sequence coding for the 
protease is extracted from a human cell or tissue having 

25 the ability to produce the novel protein of the invention 
by a known method. As the extraction method, a guanidine 
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thiocyanate hot phenol method, a guanidine thiocyanate- 
guanidine hydrochloride method and the like can be 
exemplified, but a guanidine thiocyanate cesium chloride 
method can be preferably cited. The cell or tissue having 
5 the ability to produce this protease can be specified by, 
e.g., northern blot technique using a gene or a part 
thereof having a nucleotide sequence coding for the 
protease or western blot technique using an antibody 
specific for the protease. 

10 Purification of mRNA can be carried out in accordance 

with a usual method, for example, it can be purified by 
binding it to an oligo (dT) cellulose column and then 
eluting it. Alternatively, a commercially available 
extracted and purified mRNA may be used without extracting 

15 the mRNA. 

Subsequently, single-strand cDNA is synthesized by 
carrying out reverse transcriptase reaction of the purified 
mRNA in the presence of random primers, oligo (dT) primers 
or custom-synthesized primers . Using two primers 

20 interposing a part of the gene of interest, the thus 

obtained single-strand cDNA is subjected to PCR to amplify 
the novel protein DNA of interest. Alternatively, a 
commercially available cDNA preparation may be used without 
synthesizing the cDNA. The thus obtained DNA is 

25 fractionated by a means such as agarose gel electrophoresis 
or the like. If desired, a DNA fragment of interest can be 



obtained by digesting this DNA with restriction enzymes and 
the like and then ligating the digested fragments. 
[0012] 

b) Second production method 
5 In addition to the above production method, the gene 

of the invention can be produced using conventional genetic 
engineering techniques. Firstly , single-strand cDNA is 
synthesized using reverse transcriptase and using the mRNA 
obtained by the above method as the template and then 

10 double-strand cDNA is synthesized from this single-strand 
cDNA. As the method , the SI nuclease method (Ef stratiadis , 
A. et al., Cell, 7, 279 - 288, 1976), Land method (Land, H. 
et al., Nucleic Acids Res., 9, 2251 - 2266, 1981), O. Joon 
Yoo method (Yoo, O.J. et al., Proc. Natl. Acad. Sci. USA, 

15 79, 1049 - 1053, 1983), the Okayama-Berg method (Okayama, 
H. and Berg, P., Mol. Cell. Biol., 2, 161 - 170, 1982) and 
the like can be exemplified. 

Next, an Escherichia coll strain such as DH5 a 
strain, HB101 strain, JM109 strain or the like is 

20 transformed by introducing a recombinant plasmid obtained 
by the aforementioned method, and a resulting recombinant 
can be selected using the resistance for a drug such as 
tetracycline, ampicillin, kanamycin or the like as a 
marker. Transformation of a host cell, for example, when 

25 the host cell is E. coli, can be carried out by Hanahan' s 
method (Hanahan, D.J., Mol. Biol., 166, 557 - 580, 1983), 



namely, by adding the recombinant DNA to competent cells 
prepared in the coexistence of CaCl 2 and MgCl2 or RbCl. As 
a matter of course, commercially available competent cells 
can also be used. In this connection, in addition to a 
5 plasmid, a phage vector such as a lambda system can also be 
used as a vector. 

Regarding the method for selecting DNA of the novel 
protein of interest from the thus obtained transf ormants , 
various methods shown below can for example be employed. 

10 [0013] 

(i) A screening method which uses a synthetic 
oligonucleotide probe 

An oligonucleotide corresponding to whole or a part 
of the novel protein of the invention is synthesized (in 

15 this case, it may be either a nucleotide sequence derived 
by using the codon usage or a combination of two or more 
possible nucleotide sequences, and in the latter case, the 
number of their kinds can be reduced by including inosine) , 
this is hybridized as a probe (after labeling with 32 P or 

20 33 P) with a nitrocellulose filter or nylon filter on which 
DNA samples of the transformants are denatured and 
immobilized, and then the thus obtained positive strains 
are screened and selected. 
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(ii) A screening method which uses a probe prepared 
by polymerase chain reaction 

Oligonucleotides of a sense primer and an antisense 
primer corresponding to a part of the novel protein of the 
invention are synthesized and polymerase chain reaction 
(Saiki, R.K. et al. , Science, 239, 487 - 491, 1988) is 
carried out using these primers, thereby effecting 
amplification of a DNA fragment coding for whole or a part 
of the novel protein of interest. As the template DNA to 
be used, cDNA synthesized by reverse transcription reaction 
from mRNA of cells producing the novel protein or genomic 
DNA can be used. The thus prepared DNA fragment is labeled 
with 32 P or 33 P and used as the probe to carry out colony 
hybridization or plaque hybridization to select the clone 
of interest. 

(iii) A screening method in which the novel protein 
is produced by other animal cells 

A transformant is cultured to amplify a gene, an 
animal cell is transfected with the gene (in this case, the 
vector may be either an autonomously replicating plasmid 
comprising a transcription promoter region or a plasmid 
which can be integrated into chromosome of the animal 
cell) , and the protein encoded by the gene is produced in 
the extracellular moiety. By detecting the novel protein 
using an antibody specific for the novel protein of the 
invention, a strain comprising cDNA which encodes the novel 
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protein of interest is selected from the original 
transf ormants . 

(iv) A selection method which uses an antibody 
specific for the novel protein of the invention 

5 By integrating cDNA into an expression vector in 

advance, proteins are produced in culture supernatants , 
inside the cells or on the surface of cells of 
transf ormants , and the strain of interest is selected by 
detecting the novel protein producing strain of interest 
10 using an antibody specific for the novel protein of the 

invention and a secondary antibody against this antibody. 

(v) A method which uses a selective hybridization- 
translation system 

Samples of cDNA obtained from transf ormants are 
15 blotted on a nitrocellulose filter or the like, mRNA 

prepared from the novel protein producing cells of the 
invention is hybridized therewith, and then the mRNA 
hybridized to the cDNA is dissociated and recovered. The 
thus recovered mRNA samples are translated into proteins in 
20 a protein translation system, e.g., a system in which they 
are injected into oocyte of Xenopus or a cell free system 
such as rabbit reticulocyte lysate, wheat germ or the like. 
The strain of interest is selected by detecting it using an 
antibody against the novel protein of the invention. 
25 The method for collecting DNA coding for the novel 

protein of the invention from the thus obtained 
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transformant of interest can be carried out in accordance 
with gene manipulation experiment manuals such as of a 
known method (Sambrook, J. et al., "Molecular Cloning - A 
Laboratory Manual", Cold Spring Harbor Laboratory, NY, 
5 1989) and the like. For example, it can be achieved by 
separating a fraction corresponding to plasmid DNA from 
cells and then cutting out the cDNA region from the plasmid 
DNA. 

[0014] 

10 c) Third production method 

The novel protein gene of the invention can also be 
produced by connecting DNA fragments produced by a chemical 
synthesis method. Each DNA can be synthesized using a DNA 
synthesizing machine [e.g., Oligo 1000M DNA Synthesizer 
15 (mfd. by Beckman) , 394 DNA/RNA Synthesizer (mfd. by Applied 
Biosys terns) or the like] . 
[0015] 

d) Fourth production method 

The novel protein gene of the invention can also be 
20 produced based on the information on the novel protein, for 
example, by chemical synthesis of nucleic acids in 
accordance with a conventional method such as phosphite 
triester method (Hunkapiller , M. et al., Nature, 10, 105 - 
111, 1984) or the like. In this connection, codons for 
25 desired amino acids are well known, can be selected 

optionally and can be determined in accordance with a 
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conventional method (Crantham, R. et al. , Nucleic Acids 
Res., 9, r43 - r74, 1981), taking codon usage of the host 
to be used into consideration. In addition, partial 
modification of codons of these nucleotide sequences can be 
carried out in the usual way in accordance with the site 
specific mutagenesis (Mark, D.F. et al., Proc. Natl. Acad. 
Sci. USA, 81, 5662 - 5666, 1984) or the like which uses 
primers comprised of synthetic oligonucleotides which 
encode the desired modification. 
[0016] 

Determination of sequences of DNA obtained by the 
above methods a) to d) can be carried out for example by 
Maxam-Gilbert chemical modification method (Maxam, A.M. and 
Gilbert, W. , "Methods in Enzymology", 65, 499 - 559, 1980), 
dideoxy nucleotide chain termination method (Messing, J. 
and Vieira, J., Gene, 19, 269 - 276, 1982) and the like. 

The vector of the present invention, the hose cell of 
the present invention, and the novel protein of the present 
invention are obtainable by the following methods. 
[0017] 

2) Methods for the production of the vector of the 
invention, the host cell of the invention and the 
recombinant protein of the invention 

The thus isolated fragment containing the gene coding 
for the novel protein of the invention can be transformed 
into eucaryotic or procaryotic host cells by again 
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integrating it into an appropriate vector DNA. In 
addition, it is possible to express the gene in respective 
host cells by introducing an appropriate promoter and a 
sequence concerned in the gene expression into these 
5 vectors . 

For example , the eucaryotic host cells include cells 
of a vertebrate, an insect, yeast and the like, and COS 
cell as a monkey cell (Gluzman, Y . , Cell, 23, 175 - 182, 
1081) , a dihydrofolate reductase deficient strain of 

10 Chinese hamster ovary cell (CHO) (Urlaub, G. and Chasin, 
L.A., Proc. Natl. Acad. Sci. USA, 77, 4216 - 4220, 1980), 
human fetal kidney-derived HEK293 cell, 293-EBNA cell in 
which Epstein-Barr virus EBNA-1 gene is introduced into the 
same cell (mfd. by Invitrogen) and the like are frequently 

15 used as the vertebrate cells, though limited thereto. 

As the expression vector for vertebrate cells, a 
vector having a promoter, a UNA splicing site, a 
polyadenylylation site, a transcription termination 
sequence and the like generally positioned upstream of the 

20 gene to be expressed can be used, and it may further have a 
replication origin as occasion demands . Examples of the 
expression vector include pSV2dhfr having SV40 early 
promoter (Subramani, S. et al. , Mol. Cell. Biol., 1, 854 - 
864, 1981), pEF-BOS having human elongation factor promoter 

25 (Mizushima, S. and Nagata, S., Nucleic Acids Res., 18, 
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5322, 1990), pCEP4 having cytomegalovirus promoter (mfd. by 
Invitrogen) and the like, though not limited thereto. 

In the case of the use of COS cell as the host cell, 
an expression vector which has SV40 replication origin, can 
5 perform autonomous replication in COS cell and has a 

transcription promoter, a transcription termination signal 
and an RNA splicing site can be used, and its examples 
include pME18S (Maruyams, K. and Takebe, Y. , Med. Immunol., 
20, 27-32, 1990), pEF-BOS (Mizushima, S. and Nagata, S., 

10 Nucleic Acids Res. , 18, 5322, 1990), pCDM8 (Seed, B., 

Nature, 329, 840 - 842, 1987) and the like. The expression 
vector can be incorporated into COS cell by a DEAE-dextran 
method (Luthman, H. and Magnusson, G. , Nucleic Acids Res., 
11, 1295 - 1308, 1983), a calcium phosphate-DNA co- 

15 precipitation method (Graham, F.L. and van der Ed, A.J., 
Virology, 52, 456 - 457, 1973), a method which uses 
FuGENE™6 Transfection Reagent (mfd. by Boehringer 
Mannheim) , an electroporation method (Neumann, E. et al. , 
EMBO J., 1, 841 - 845, 1982) and the like, thus enabling to 

20 obtain a desired transformant cell. 
[0018] 

Also, when CHO cell is used as the host cell, a 
transformant cell which can stably produce the novel 
protein can be obtained by co-transf ecting a vector capable 
25 of expressing neo gene which functions as a G418 resistance 
marker, such as pRSVneo (Sambrook, J. et al., "Molecular 
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Cloning - A Laboratory Manual", Cold Spring Harbor 
Laboratory, NY, 1989), pSV2-neo (Southern, P.J. and Berg, 
P., J. Mol. Appl. Genet., 1, 327 - 341, 1982) or the like, 
together with an expression vector and then selecting a 
5 G418-resistant colony. Also, when 293-EBNA cell is used as 
the host cell, a desired transf ormant cell can be obtained 
using an expression vector which has Epstein-Barr virus 
replication origin and can perform autonomous replication 
in the 293-EBNA cell, such as pCEP4 (mfd. by Invitrogen) or 

10 the like. 

The thus obtained transf ormant cell of interest can 
be cultured in accordance with a conventional method, and 
the novel protein of the invention is produced in 
extracellular moiety by this culturing. As the medium to 

15 be used in the culturing, various conventionally used media 
can be optionally selected depending on the host cell 
employed. In the case of, for example, the COS cell, a 
medium such as a RPMI-1640 medium, Dulbecco's modified 
Eagle's minimum essential medium (DMEM) or the like which 

20 may be supplemented, as occasion demands, with a serum 

component such as fetal bovine serum (FBS) or the like may 
be used. Also, in the case of the 293-EBNA cell, a medium 
such as Dulbecco's modified Eagle's minimum essential 
medium (DMEM) or the like supplemented with a serum 

25 component such as fetal bovine serum (FBS) or the like and 
further supplemented with G418 may be used. 
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The novel protein of the invention thus produced in 
the extracellular moiety of the transformant cell can be 
separated and purified by various known separation 
techniques making use of physical characteristics, 
biochemical characteristics and the like of the novel 
protein. Illustrative examples of such techniques include 
treatment of a culture broth containing the novel protein 
with a usual protein precipitant, ultrafiltration, various 
types of liquid chromatography such as molecular sieve 
chromatography (gel filtration), adsorption chromatography, 
ion exchange chromatography, affinity chromatography, high 
performance liquid chromatography (HPLC) and the like, 
dialysis and combinations thereof. 

When the novel protein of the invention is expressed 
after its in frame fusion with a marker sequence, 
expression verification, purification and the like of the 
novel protein become possible. Examples of the marker 
sequence include FLAG epitope, Hexa-Histidine tag, 
Hemagglutinin tag, myc epitope and the like. Also, when a 
specific amino acid sequence recognizable by proteases 
such as enterokinase, factor Xa, thrombin and the like is 
inserted between a marker sequence and the novel protein, 
the marker sequence moiety can be cut off and removed by 
these proteases. 
25 [0019] 
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3) Method for detecting me tallopro tease activity of the 
novel protein of the invention 

Protease activity of the novel protein of the 
invention and a partial peptide thereof can be detected by 
5 mixing the novel protein of the invention or a partial 
peptide thereof with a substrate cited below in an 
appropriate buffer solution, allowing them to undergo the 
reaction and then detecting it by a method suited for 
respective substrate . 

10 As the substrate, a fluorescence- or radiation- 

labeled substrate, a synthetic substrate having a 
fluorophore, a quenching group or a chromophore, an 
unlabeled substrate and the like can be exemplified. 
Examples of the fluorescence- or radiation-labeled 

15 substrate include fluorescence- or radiation-labeled 

gelatin, collagen, synthetic peptides and the like, the 
synthetic substrate having a fluorophore includes Glt-Ala- 
Ala-Phe-MCA, Lys-MCA, Pyr-Arg-Thr-Lys-Arg-MCA and the like 
(Peptide Research Institute) , the synthetic substrate 

20 having a quenching group includes MOCAc-Arg-Pro-Lys-Pro- 
Lys-Pro-Tyr-Ala-Nva-Trp-Met (Dnp) ~NH 2 , MOCAc-Tyr-Val-Ala- 
Asp-Ala-Pro-Lys (Dpn) -NH 2 and the like (Peptide Research 
Institute) , the synthetic substrate having a chromophore 
includes Ala-pNA, Bz-Tyr-pNA, Pyr-Phe-Leu-pNa and the like 

25 (Peptide Research Institute) and the unlabeled substrate 
includes casein, collagen, fibronectin, aggrecan, gelatin 

30 



and the like proteins, insulin and the like physiologically 
active peptides, synthetic peptides and the like. The 
synthetic peptides include those which contain non-natural 
type amino acids too. 
5 For example, when a radiation-labeled substrate or a 

substrate having a fluorophore, a quenching group or a 
chromophore is used, the protease activity can be detected 
using a liquid scintillation counter, a fluorescence 
detector, a spectrophotometer or the like appropriate 
10 detector. When an unlabeled substrate is used, the 
protease activity can be detected by analyzing its 
degradation product using SDS-PAGE, HPLC, Zymography or the 
like. 

[0020] 

15 4) Method for detecting aggrecanase activity 

As the substrate for detecting aggrecanase activity, 
aggrecan purified from a cartilage or tissue of human or 
other animal, aggrecan obtained by genetic recombination, 
commercially available aggrecan (Seikagaku Kogyo) or a 

20 partial protein thereof can be used. The aggrecanase 

activity can be measured by allowing these substrates to 
react with a cell or tissue culture broth, cell or tissue 
extract or (partially) purified sample containing a 
protease to be tested, and then detecting a fragment cleaved 

25 off between Glu373-Ala374 . For the detection of the fragment 
cleaved off between Glu373-Ala374 , a method in which 



an N~terminal sequence or a C-terminal sequence of the 
digested fragment is determined in accordance with a 
conventional method, or more conveniently, ELISA (enzyme- 
linked immunosorbent assay) which uses an anti-neoepitope 
antibody (Hughes C.E. et al. , Biochem. J., 305, 799 - 804, 
1995) capable of specifically recognizing C-terminal NITGE 
and N-terminal ARGSV formed by the cutting between Glu373- 
Ala374, a western blotting or the like immunological method 
can be used. 
[0021] 

5) Method for preparing antibody which reacts with 
the novel protein of the invention 

The antibody which reacts with the novel protein of 
the invention, such as a polyclonal antibody or a 
monoclonal antibody, can be obtained by directly 
administering the novel protein or a fragment of the novel 
protein to various animals. It can also be obtained by a 
DNA vaccine method (Raz, E. et al. , Proc. Natl. Acad. Sci. 
USA, 91, 9519 - 9523, 1994; Donnelly, J.J. et al. , J. 
Infect. Dls., 173, 314 - 320, 1996) using a plasmid into 
which a gene coding for the novel protein of the invention 
is introduced. 

A polyclonal antibody is produced from a serum or egg 
of an animal such as rabbit, rat, goat, domestic fowl or 
the like which is sensitized by immunizing the animal with 
the novel protein or a fragment thereof emulsified in an 
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appropriate adjuvant such as complete Freund' s adjuvant by 
its peritoneal, subcutaneous, intravenous or the like 
injection. The thus produced polyclonal antibody can be 
separated and purified by usual protein isolation and 
5 purification techniques, and examples of the usual protein 
isolation and purification techniques include 
centrifugation, dialysis, salting out with ammonium sulfate 
and a chromatography using DEAE-cellulose, hydroxy apatite, 
protein A agarose or the like. 
10 A monoclonal antibody can be produced easily by those 

skilled in the art by the cell fusion method of Kohler and 
Milstein (Kohler, G. and Milstein, C. , Nature, 256, 495 - 
497, 1975) . 

That is, mouse is immunized by emulsifying the novel 
15 protein of the invention or a fragment thereof in an 

appropriate adjuvant such as complete Freund' s adjuvant and 
inoculating the emulsion several times at intervals of a 
few weeks by its peritoneal, subcutaneous, intravenous or 
the like injection. After the final immunization, spleen 
20 cells are taken out and fused with myeloma cells to prepare 
hybridomas . 

As the myeloma cells for obtaining hybridomas, a 
myeloma cell having a marker (e.g., hypoxanthine-guanine 
phosphoribosyl transferase deletion, thymidine kinase 
25 deletion or the like) , such as a mouse myeloma cell strain 
P3X63Ag8.Ul, is used. Also, polyethylene glycol is used as 
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the fusing agent. As the medium for preparing hybridomas , 
Eagle's minimum essential medium, Dulbecco' s minimum 
essential medium, RPMI1640 or the like usually used medium 
is used by optionally supplementing it with 10 to 30% fetal 
5 bovine serum. The fused strains are selected by the HAT 
selection method. Screening of hybridomas is carried out 
using culture supernatants by ELISA, immunological tissue 
staining or the like well known method or by the 
aforementioned screening method, and a clone of hybridoma 

10 which secretes the antibody of interest is selected. Also, 
monoclonal nature of the hybridoma is confirmed by 
repeating subcloning by limiting dilution. By culturing 
the thus obtained hybridoma in a medium for several days or 
in the abdominal cavity of a pristane-pretreated BALB/c 

15 mouse for 10 to 20 days, the antibody is produced in a 

purification -possible amount. The thus produced monoclonal 
antibody can be separated and purified from the culture 
supernatant or ascitic fluid by usual protein isolation 
purification techniques . 

20 Active antibody fragments containing a part of the 

antibody, such as F(ab') 2 , Fab, Fab' and Fv, can be 
obtained by digesting the thus separated and purified 
antibody with a proteolytic enzyme such as pepsin, papain 
or the like in the conventional way and then separating and 

25 purifying the fragments by usual protein isolation 
purification techniques. 
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Furthermore, it is possible to obtain the antibody 
which reacts with the novel protein of the invention as 
single chain Fv or Fab by the methods of Clackson et al. 
and Zebedee et al. (Clackson, T. et al., Nature, 352, 624 - 
5 628, 1991; Zebedee, S. et al. , Proc. Natl. Acad. Sci. USA, 
89, 3175 - 3179, 1992). In addition, it is possible to 
obtain a human antibody by immunizing a transgenic mouse in 
which a mouse antibody gene is replaced by a human antibody 
gene (Lonberg, N. et al . , Nature, 368, 856 - 859, 1994). 
10 [0022] 

6) Method for screening a substance capable of modifying 
metalloprotease activity of the novel protein of the 
invention 

The screening method of the invention contains a 
15 means in which, using the novel protein prepared at least 
by the production methods shown by the aforementioned 1) 
and 2) , a substance to be tested is added to a system for 
measuring an index of the modification of the 
metalloprotease of the novel protein in response to a 
20 biochemical characteristic of the novel protein, and the 

index is measured. In this case, examples of the measuring 
system include various well-known protease measuring 
systems (Biochemical Experimentation Methods 30 
"Proteolytic Enzymes I" edited by D. Tsuru and M. Funatsu 
25 published by Gakkai Shuppan Center, 1993, ibid. 31, 

"Proteolytic Enzymes I" Gakkai Shuppan Center, 1993) , and 



screening of the substance to be tested can be carried out 
by, in accordance with or applying the treating methods 
described in the documents. 

Regarding the substance to be tested, compounds or 
5 peptides which are generally known to have metalloprotease 
inhibition activity but their activities to modify 
metalloprotease activity of the novel protein are unclear, 
or various known compounds and peptides, compounds 
synthesized using combinatorial chemistry techniques 

10 (Terrett, N.K. et al., Tetrahedron, 51, 8135 - 8137, 1995) 
or general synthesis techniques and random peptides 
prepared by applying a phage display method (Felici, F. et 
al., J. Mol. Biol., 222, 301 - 310, 1991) can be used. In 
addition, extracts and culture supernatants of 

15 microorganisms, natural components derived from plants and 
marine organisms, animal tissue extracts and the like are 
also become objects of the screening. Alternatively, 
compounds or peptides prepared by chemically or 
biologically modifying the compounds or peptides selected 

20 by the screening method of the invention can also be used. 

For the screening of substances (compounds, peptides, 
antibodies and antibody fragments) capable of modifying 
protease activity of the novel protein of the invention , 
any substance which becomes the substrate of the novel 

25 protein of the invention or of a partial peptide thereof 
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can be used, and the substrates described in the 
aforementioned 3) are desirable. 
[0023] 

7) Method for screening a substance capable of modifying 
5 aggrecanase activity of the novel protein of the invention 
This can be screened by a similar method of the 
aggrecanase activity detection method described in 4) . 
Also, the ELISA or the like method exemplified in (Example 
10-2) can be used, in which added aggrecan, recombinant 

10 aggrecan, commercially available aggrecan or a partial 
protein thereof which disappears or decreases by its 
degradation when allowed to react with the novel protein of 
the invention is measured using an antibody which 
specifically recognizes polypeptides of the N-terminal side 

15 and C- terminal side moieties of the region cut off with 
aggrecanase. Also useful is a method in which the novel 
protein of the invention is allowed to react with a 
recombinant aggrecan in which FLAG tag is added to the N- 
terminal, and His tag to the C-terminal, as exemplified in 

20 (Example 7-1) , and the added recombinant aggrecan 

disappeared or decreased by its degradation is measured by 
ELISA or the like method using an anti-FLAG tag and anti- 
His tag antibodies. The tags in this case are not limited 
to FLAG tag and His tag, and the recombinant aggrecan is 

25 not limited to (Example 7-1) and may be a partial protein 
or modified protein of aggrecan which is cut off at the 
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aggrecanase cutting site by this protein. As the test 
substances to be used for the aggrecanase activity , 
substances similar to the test substances to be used for 
the metalloprotease activity of 6) are used. 
[0024] 

8) Method for detecting promoter activity 

As the method for detecting the promoter activity 
possessed by the sequences and partial sequences thereof 
shown in Examples , a method in which a reporter gene 
plasmid is used is convenient. The reporter gene means a 
gene coding for a protein which can be determined by usual 
means (e.g., measurement of enzyme activities and the like 
determination methods well known to those skilled in the 
art) , and chloramphenicol acetyl transferase, lucif erase, p- 
galactosidase and alkaline phosphatase genes are frequently 
used, though not limited thereto. Regarding a vector as 
the base for constructing a reporter gene plasmid, there is 
no limitation and commercially available plasmid vectors 
such as pGV-B2 (mfd. by Toyo Ink) , pSEAP2 -Basic (mfd. by 
Clontech) and the like can be used. By constructing a 
reporter gene plasmid in which the sequence is inserted in 
the forward direction into upstream of the reporter gene of 
these vectors and measuring amount of the reporter protein 
expressed in cells transformed with this plasmid, by a 
method suited for respective case, the presence and 
strength of the promoter activity of the sequence can be 
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known , and action of a substance to be tested upon this 
promoter activity can be detected by adding the substance 
to be tested to a culture broth of the transformed cells. 
[0025] 

5 9) Method for screening a substance capable of modifying 
the promoter activity of the invention 

For the screening of substances (compounds, peptides, 
antibodies and antibody fragments) which inhibit the 
promoter activity possessed by the sequence of the Sequence 

10 ID No. of the invention and a partial sequence thereof, a 
method similar to the promoter activity detection method 
shown in 8) can be used. Regarding the substance to be 
tested, compounds or peptides which are generally known to 
modify promoter activity but their activities to modify the 

15 promoter activity possessed by the sequences of SEQ ID 

NOs;24 to 31 and partial sequences thereof are unclear, or 
various known compounds and peptides, compounds synthesized 
using combinatorial chemistry techniques (Terrett, N.K. et 
al. , Tetrahedron, 51, 8135 - 8137, 1995) or general 

20 synthesis techniques and random peptides, antibodies and 
antibody fragments prepared by applying a phage display 
method (Felici, F. et al. , J. Mol. Biol., 222, 301 - 310, 
1991) can be used. In addition, extracts and culture 
supernatants of microorganisms, natural components derived 

25 from plants and marine organisms, animal tissue extracts 
and the like are also become the object of the screening. 
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Alternatively, compounds or peptides prepared by chemically 
or biologically modifying the compounds or peptides 
selected by the screening method of the invention can also 
be used. 

[0026] 

A medicament which comprises as the active ingredient 
the novel protein or a substance , among the substances 
(compounds, peptides, antibodies or antibody fragments) 
capable of significantly modifying activities of the novel 
protein selected by the aforementioned screening method, 
capable of significantly inhibiting the activity of a 
me tallopro tease having aggrecanase activity is included in 
the invention, and a proteoglycan degradation inhibitor is 
particularly desirable as the medicament. Examples of the 
substance which significantly inhibits activity of the 
novel protein include known compounds selected by the 
screening system shown in (Example 10-2) and disclosed in 
WO 90/05719, but not only medicaments containing these 
known compounds as the active ingredient but also all 
medicaments which contain substances capable of 
significantly inhibiting activity of the novel protein are 
included in the invention. In this connection, regarding 
the production method of the known compounds disclosed in 
WO 90/05719, they can be synthesized in accordance with the 
production method disclosed in this International 
Publication . 
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The pharmaceutical preparations which comprise as the 
active ingredient the novel protein of the invention or a 
substance (a compound, peptide, antibody or antibody 
fragment) which significantly inhibits activity of the 
5 novel protein can be prepared using carriers, fillers and 
other additives usually used for their preparation, in 
response to each type of the active ingredient. 

Examples of its administration include oral 
administration using tablets, pills, capsules, granules, 

10 fine subtilaes, powders, oral solutions and the like and 

parenteral administration using intravenous, intramuscular, 
intraarticular and the like injections, suppositories, 
percutaneous preparations, transmucosal preparations and 
the like. Particularly in the case of peptides which are 

15 digested in the stomach, parenteral administration such as 
intravenous injection or the like is desirable. 

In the solid composition for use in the oral 
administration according to the present invention, one or 
more active substances are mixed with at least one inert 

20 diluent such as lactose, mannitol, glucose, 

microcrystalline cellulose, hydroxypropylcellulose, starch, 
polyvinyl pyrrolidone, aluminum magnesium silicate or the 
like. In the usual way, the composition may contain other 
additives than the inert diluent, such as a lubricant, a 

25 disintegrating agent, a stabilizing agent, a solubilizing 
or solubilization assisting agent or the like. If 
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necessary, tablets or pills may be coated with a sugar or a 
film of a gastric or enteric substance. 
[0027] 

The liquid composition for oral administration 
5 includes emulsions, solutions, suspensions, syrups and 

elixirs and contains a generally used inert diluent such as 
purified water or ethanol. In addition to the inert 
diluent, this composition may contain auxiliary agents such 
as a moistening agent, a suspending agent, a sweetener, an 

10 aromatic agent and an antiseptic agent. 

The injections for parenteral administration includes 
aseptic aqueous or non-aqueous solutions, suspensions and 
emulsions . Examples of the diluent for use in the aqueous 
solutions and suspensions include distilled water for 

15 injection, physiological saline and the like. Examples of 
the diluent for use in the non-aqueous solutions and 
suspensions include propylene glycol, polyethylene glycol, 
plant oil (e.g., olive oil or the like), alcohol (e.g., 
ethanol) , Polysorbate 80 and the like. Such a composition 

20 may further contain a moistening agent, an emulsifying 
agent, a dispersing agent, a stabilizing agent, a 
solubilizing or solubilization assisting agent, an 
antiseptic and the like. These compositions are sterilized 
by filtration through a bacteria retaining filter, blending 

25 of a germicide or irradiation. Alternatively, they may be 
used by firstly making into sterile solid compositions and 
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dissolving them in sterile water or a sterile solvent for 
injection use prior to their use. 

The clinical dose is optionally decided by taking 
into consideration strength of activity of the active 
5 ingredient selected by the aforementioned screening method, 
symptoms, age, sex and the like of each patient to be 
treated . 

[0028] 
[Examples] 

10 The following describes the invention more 

illustratively . 

Unless otherwise noted, experiments were carried out 
in accordance with gene manipulation experiment manuals 
such as of a known method (Sambrook, J. et al. , "Molecular 
15 Cloning - A Laboratory Manual", Cold Spring Harbor 

Laboratory, NY, 1989) and the like, but the invention is 
not limited to the Examples. 
[0029] 

(Example 1) Discovery of partial sequence of a novel 
20 ADAMTS gene MDTS6 

A human brain cDNA library strictly fractionated by 
the size of insertion sequences was constructed as shown in 
a reference (Ohara O. et al. , DNA Res., 4, 53 - 59, 1997). 
Size distribution of cDNA fragments in these sub-libraries 
25 was from 3 kbp to 8 kbp. By deciphering 5'- and 3 '-end 

sequences of clones constituting this library, an in-house 

43 



EST data bank was constructed, A partial sequence of MDTS6 
was obtained from this, 
[0030] 

(Example 2) Determination of full-length ORF sequence 
5 of MDTS6 

By determining sequences of MDTS6 cDNA clones, a 
sequence of from the 832nd position to the 2853rd position 
of SEQ ID NO: 2 was obtained. The sequence of from the 1st 
position to the 831st position of SEQ ID NO: 2 was obtained 

10 by repeating RACE (Rapid Amplif ication of cDNA Ends) using 
human brain and human placenta Marathon-Ready™ cDNA 
manufactured by Clontech as the template and LA-Taq™ (mfd. 
by Takara Shuzo) as the DNA polymerase. As a result, it 
was revealed that the full-length MDTS6 was a novel protein 

15 composed of 950 amino acids as shown in SEQ ID NO:l. Its 
domain structure was composed of a secretion signal 
sequence, a pro region, a furin protease recognition 
sequence, a metalloprotease domain, a disintegrin domain, a 
thrombospondin type I repeat sequence (to be referred to as 

20 TSP-1 repeat sequence hereinafter) , a domain rich in Cys 
residue, an intermediate region and two TSP-1 repeat 
sequences, in order from the N-terminus, and it was a 
molecule belonging to the ADAMTS family (Kuno, K. et al., 
J. Biol. Chem. , 272, 556 - 562, 1997; Tang, B.L. et al. , 

25 FEBS Lett., 445, 223 - 225, 1999). 
[0031] 



(Example 3) Preparation of C- terminal FLAG addition 
type expression vector 

An EBNA1 expression unit-removed expression vector 
pCEP4d was constructed by digesting pCEP4 (mfd. by 
5 Invitrogen) with restriction enzymes Clal and NslI , blunt- 
ending the resulting fragments and then carrying out their 
autonomous ligation. This vector was digested with 
restriction enzymes Nhel and BamHI and extracted from an 
agarose gel to obtain a fragment of about 7 . 7 kbp , and a 

10 double strand of oligonucleotide obtained by annealing a 

nucleic acid shown by SEQ ID NO: 3 and a nucleic acid shown 
by SEQ ID NO: 4 was inserted into the fragment to select a 
clone having the planned sequence which was named pCEP4d- 
FLAG. Using this vector as the template and oligoDNA shown 

15 by SEQ ID NO: 5 and oligoDNA shown by SEQ ID NO: 6 as 
primers, PCR was carried out using PyroBest DNA 
polymerase . The thus generated DNA fragment of about 0 . 4 
kbp was digested with a restriction enzyme Spel and 
inserted into pCEP4d-FLAG (about 7.7 kbp) which had been 

20 digested with Xbal, and a clone in which Xbal , Nhel, NotI 
and BamHl recognition sequence cloning sites and FLAG tag 
were arranged in that order from the promoter as intended 
was selected, thereby completing pCEP4dE2 -FLAG . 
[0032] 
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(Example 4) Construction of MDTS6 truncated protein 
(MDTS6TSP1) expression plasmid 

A plasmid was constructed in the following manner for 
use in expressing a sequence of from the 1st position to 
5 the 583rd position of SEQ ID N0:1 (a moiety corresponding 
to a region containing the TSP1 repeat sequence from the N- 
terminus of MDTS6 (to be referred to as MDTS6TSP1 
hereinafter) ) as a protein in which FLAG was added to the 
C- terminus . 

10 Firstly, a gene of from the 1st position to the 

1749th position of SEQ ID NO: 2 was obtained by PCR. Using 
oligoDNA sequences represented by SEQ ID NO: 7 and SEQ ID 
NO: 8 as primers, human placenta Marathon -Ready™ cDNA (mfd. 
by Clontech) as the template and LA-Taq™ (mfd. by Takara 

15 Shuzo) as DNA polymerase, a cycle of 98°C for 10 seconds 

and 68°C for 2 minutes was repeated 10 times after heating 
at 94°C for 1 minutes. Using a DNA solution prepared by 50 
times -diluting this reaction solution as the template and 
using PyroBest™ DNA polymerase, PCR was carried out under 

20 a condition of 94°C for 2 minutes, 40 repetitions of a 

cycle of 98°C for 10 seconds, 66°C for 30 seconds and 74°C 
for 4 minutes and subsequent 72°C for 10 minutes. The thus 
generated fragment of interest in which Xbal recognition 
sequence and Kozak sequence were added to the 5' side, and 

25 No tl recognition sequence to the 3' side, was subcloned 
into PCR-Blunt to confirm the sequence and then digested 
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with restriction enzymes Xbal and NotI and inserted into 
the Xbal-NotI site of pCEP4dE2-FLAG to complete pCEP- 
MDTS6TSP1-FLAG . 
[0033] 

(Example 5) Construction of MDTS6 full-length protein 
expression plasmid 

A plasmid was constructed in the following manner for 
use in expressing a sequence of from the 1st position to 
the 950th position of SEQ ID NO:l as a protein in which 
FLAG was added to the C-terminus. 

Firstly, a gene of from the 1534th position to the 
2850th position of SEQ ID NO: 2 was obtained by PGR. 
Illustratively, using oligoDNA sequences represented by SEQ 
ID NO: 9 and SEQ ID NO: 10 as primers, the plasmid DNA of EST 
clone as the template and PyroBest™ DNA polymerase as DNA 
polymerase, a cycle of 98°C for 10 seconds, 50°C for 15 
seconds and 72°C for 2 minutes was repeated 20 times after 
heating at 94°C for 1 minutes, followed by 7 minutes of 
reaction at 72°C. The thus generated fragment of interest 
in which NotI recognition sequence was added to the 3' side 
was subcloned into PCR-Blunt to confirm the sequence and 
used as pCRB-MDTS6-3H . 

Making use of the presence of a BamHI recognition 
sequence in a sequence of from the 1566th position to the 
1571st position of SEQ ID NO: 2, pCEP-MDTS6TSPl-FLAG was 
digested with restriction enzymes Xbal and BamHI , and the 
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thus generated DNA fragment of about 1 . 6 kbp was connected 
to a DNA fragment of about 1.3 kbp generated by digesting 
pCRB-MDTS6-3H with BamHI and NotI and inserted into the 
Xbal-NotI site of pCEP4dE2 -FLAG to complete pCEP-MDTS6F- 
5 FLAG. 

[0034] 

(Example 6) Expression of MDTS6TSP1 by animal cell 

strain 

The expression plasmid prepared in Example 4 using 

10 pCEP4dE2 -FLAG as the backbone was introduced into HEK293- 
EBNA cell (mfd. by Invitrogen) using FuGENE™6 Transfection 
Reagent (mfd. by Boehringer Mannheim) in accordance with 
the attached instructions . After introduction of the 
plasmid , the presence of the protein of interest in a 

15 culture supernatant obtained by 1 to 2 days of culturing 

was confirmed by western blotting using an antibody against 
FLAG tag added to the C-terminus (a mouse anti-FLAG 
monoclonal antibody (M2 ; mfd. by Sigma)). That is, the 
culture supernatant was subjected to electrophoresis using 

20 SDS/10% to 20% acrylamide gel (mfd. by Daiichi Pure 

Chemicals) and then transferred on a PVDF membrane using a 
blotting apparatus . The PVDF membrane after the transfer 
was subjected to blocking by adding Block Ace (mfd. by 
Dainippon Pharmaceutical) and then allowed to react with 

25 the mouse anti-FLAG monoclonal antibody (M2; mfd. by Sigma) 
and a horseradish peroxidase-labeled rabbit anti-mouse IgG 
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polyclonal antibody (mfd. by Zymed or TAGO) in that order. 
Alternatively, after the blocking, it was allowed to react 
with biotinylated M2 antibody (mfd. by Sigma) and a 
streptoavidine-horseradish peroxidase conjugate (mfd. by 
5 Amersham) in that order. After the reaction, expression of 
the protein was confirmed using an ECL western blotting 
detection system (mfd. by Amersham Pharmacia) (Fig. 1) . 
Molecular weight of the expressed MDTS6TSP1 protein was 
smaller than the value calculated from the amino acid 

10 sequence by a factor of about 23 K. Making use of the fact 
that FLAG tag is added to the Oterminus of MDTS6TSP1 
protein expressed by the HEK293-EBNA cell as described in 
the foregoing, MDTS6TSP1 protein was affinity-purified by 
the method of Example 7-1 and then transferred on a PVDF 

15 membrane, and the N-terminal sequence of MDTS6TSP1 protein 
stained with Ponceau S was determined by analyzing with 
Type 494 Peptide Sequencer manufactured by ABI . As a 
result, it was shown that it starts from the 213th position 
Phe of SEQ ID NO:l and, similar to the case of other ADAMTS 

20 molecules, becomes mature protein (from 213th position to 
583rd position of SEQ ID NO:l) by being cleaved at the 
furin protease recognition sequence existing between the 
pro region and metalloprotease domain. Also, the MDTS6 
full-length protein was obtained in the same manner as the 

25 case of the above MDTS6TSP1 protein expression using the 
expression plasmid obtained in Example 5, and similar to 
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the case of MDTS6TSP1, it was shown that it becomes mature 
protein (from 213th position to 950th position of SEQ ID 
NO:l) by being cleaved at the furin protease recognition 
sequence existing between the pro region and 
5 metal lopro tease domain. 
[0035] 

(Example 7) Detection of enzyme activity of MDTS6TSP1 
protein expressed in animal cell host 

(Example 7-1) Preparation of recombinant aggrecan 

10 G1G2 

Using oligoDNA sequences represented by SEQ ID NO: 11 
and SEQ ID NO: 12 synthesized based on the reported gene 
sequence of human aggrecan (Doege K. et al., Biochem. Soc. 
Trans., 18, 200 - 202, 1990) as primers, human placenta 

15 Marathon-Ready™ cDNA as the template and PyroBest™ DNA 
polymerase as DNA polymerase, the reaction of 94°C for 1 
minute, 40 repetitions of a cycle of 98°C for 10 seconds 
and 68°C for 2 minutes, and subsequent 68°C for 7 minutes 
was carried out. The thus generated DNA fragment was 

20 digested with a restriction enzyme BamHI and inserted into 
the BamHI site of pCEP-SigFla, thereby completing an 
expression plasmid pCEP-rAgg for use in the expression of a 
protein in which FLAG tag is added to the N-terminus, and 
His tag to the C-terminus, of the globular domain 1 (61)- 

25 globular domain 2 (G2) of human aggrecan. The pCEP-SigFla 
is an expression vector which is prepared by introducing 
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double strand of the oligoDNA sequences represented by SEQ 
ID NO: 13 and SEQ ID NO: 14 into the Hindlll-Xhol site of 
pCEP4d and has the influenza virus hemagglutinin secretion 
signal sequence described in the report (Guan X-M. et al. , 
J. Biol. Chem. , 267, 21995 - 21998, 1992), FLAG tag and 
BamHI recognition sequence in that order downstream of the 
promoter . 

The plasmid pCEP-rAgg was introduced into HEK2 93-EBNA 
cell which was subsequently cultured for 3 to 7 days, 
thereby effecting expression and production of the protein 
of interest. Purification of the protein of interest from 
the culture supernatant was carried out by an affinity 
purification making use of the addition of FLAG tag to the 
N-terminus . That is , the culture supernatant was applied 
to M2 -agarose (mfd. by Sigma) packed in a column, washed 
with 20 mM Tris-HCl (pH 7.4)/150 mM NaCl (to be referred to 
as TBS hereinafter), eluted and fractionated with 0.1 M 
Gly-HCl (pH 3.0) and immediately neutralized with 1 M Tris- 
HCl (pH 8.0) . 
[0036] 

(Example 7-2) Detection of recombinant aggrecan G1G2 
degrading activity of MDTS6 truncated protein 

In Example 6, the medium 12 to 16 hours after 
introduction of the expression plasmid was replaced by a 
serum-free medium, and the culturing was continued for 32 
to 36 hours to recover the culture supernatant. This 
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culture supernatant was mixed with the recombinant aggrecan 
prepared in the foregoing, and the mixture was allowed to 
undergo the reaction at 37°C overnight, subjected to SDS- 
PAGE , transferred on a PVDF membrane and blocked by the 
5 method described in Example 6 and then allowed to react 
with an anti-Hisx6 polyclonal antibody (sc-803; mfd. by 
Santa Cruz Biotechnology) and a horseradish peroxidase- 
labeled goat anti-rabbit IgG polyclonal antibody (mfd. by 
MBL) in that order. After the reaction, the recombinant 

10 aggrecan was detected using an ECL western blotting system 
(mfd. by Amersham Pharmacia) . As a result, degraded 
fragment of the recombinant aggrecan, which was not found 
in the control in which only the expression plasmid was 
introduced, was detected (Fig. 2) . 

15 [0037] 

(Example 7-3) Analysis by anti-aggrecanase neoepitope 
antibody 

Aggrecanase is a metalloprotease which selectively 
cleaves aggrecan at the site between Glu373-Ala374 . An 

20 antibody capable of recognizing a C-side neoepitope 

generated by this cleavage was prepared in accordance with 
a usual method by repeating immunization of mouse with a 
conjugate of the synthetic peptide having Ala-Arg-Gly-Ser- 
Val-Val-Leu-Thr-Ala-Lys-Cys and KLH, 5 times. A PVDF 

25 membrane after transfer and blocking carried out in the 
same manner as in Example 7-2 was allowed to react with 
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this antibody, allowed to react with a peroxidase-labeled 
goat anti-mouse IgG polyclonal antibody (mfd. by Tago) and 
then detected using an ECL western blotting detection 
system (mfd. by Amersham Pharmacia) . As a result, the 
degraded product of recombinant aggrecan generated by MDTS6 
reacted with the anti-aggrecanase neoepitope antibody, and 
molecular weight of the detected band is consistent with 
the molecular weight of the degraded product detected in 
Example 7-2 (Fig. 3) . The same result was obtained by the 
BC-3 antibody which recognizes aggrecanase neoepitope 
(Hughes C.E. et al. , Biochemical J. , 305, 799 - 804, 1995). 
[0038] 

(Example 8) Expression induction of MDTS6 iaRNA by 

IL-1 

It is known that a mouse cell strain ATDC5 is 
differentiated into a chondrocyte-like cell by insulin 
treatment (Atsumi T. et al., Cell Differ. Dev., 30, 109 - 
116, 1990) . The ATDC5 cells were inoculated in 4 x 
10 5 /well portions into an I type collagen-coated 6 well 
plate (mfd. by Asahi Technoglass) and cultured for 2 days 
using DMEM/ HamF12 (l:l)/5% FCS medium, the medium was 
changed to DMEM/HamFl2 (l:l)/5% FCS medium containing 
insulin (final concentration 30 ng/ml) and 50 |xg/ml of L- 
ascorbic acid and the culturing was continued for 5 days, 
and then the resulting cells were treated for 0, 1, 2, 4 or 
8 hours by adding IL-lp (final concentration 5 ng/ml) . 
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Total RNA was prepared from each of the treated groups 
using ISOGEN (mfd. by Nippon Gene) , and RT-PCR was carried 
out using 1 \ig portion thereof as the template and using 
BcaBEST™ RNA PCR Kit (mfd. by Takara Shuzo) . The reverse 
5 transcription reaction was carried out using Oligo dT- 
Adaptor Primer as the primer in accordance with the 
attached instructions , and PCR was carried out using 
oligoDNA sequences represented by SEQ ID NO: 15 and SEQ ID 
NO: 16 as primers, which had been synthesized based on the 

10 3' non- translation region of MDTS6, by the reaction of 94°C 
for 2 minutes, 40 repetitions of a cycle of 94°C for 30 
seconds, 60°C for 30 seconds and 72°C for 30 seconds, and 
subsequent 72°C for 7 minutes. The reaction solution was 
subjected to electrophoresis with 1% agarose, and densities 

15 of the thus generated bands of about 0.3 kbp were compared. 
As a result, it was found that expression of the MDTS6 mRNA 
is transiently induced by IL-1 (Fig. 4) . 
[0039] 

(Example 9) Degradation of natural type aggrecan by 

20 MDTS6 

(Example 9-1) Expression of various full-length MDTS6 
proteins and their recombinant aggrecan G1G2 degrading 
activity 

The expression plasmid constructed using pCEP4dE2- 
25 FLAG as the backbone was introduced into HEK293-EBNA cell 
(mfd. by Invitrogen) using FuGENE™6 Transfection Reagent 
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(mfd. by Boehringer Mannheim) in accordance with the 
attached instructions. After introduction of the plasmid, 
the resulting cells were cultured overnight and washed with 
PBS buffer, and then the medium was changed to a serum-free 
5 medium and the culturing was continued for 2 to 3 days. 
The resulting culture broth was centrifuged at 9,000 rpm 
for 10 minutes , and the supernatant was used as the enzyme 
source of MDTS6. In this case, in addition to the 
expression plasmids described in Example 4 and Example 5, 

10 expression plasmids for three proteins, namely a protein in 
which the polypeptide of Gly-Ser-Ala-Ala-Ala-Asp-Tyr-Lys- 
Asp-Asp-Asp-Asp-Lys was added to the C-terminus of the 
amino acids of from the 1st position to the 447th position 
of SEQ ID NO:l, of the amino acids of from the 1st position 

15 to the 518th position of SEQ ID NO:l, of the amino acids of 
from the 1st position to the 685th position of SEQ ID NO:l, 
of the amino acids of from the 1st position to the 841st 
position of SEQ, ID NO:l, or of the amino acids of from the 
1st position to the 896th position of SEQ ID NO:l, were 

20 designed as expression plasmids. For example, the 

expression plasmid for a protein in which the polypeptide 
of Gly-Ser-Ala-Ala-Ala-Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys was 
added to the C- terminus of the amino acids of from the 1st 
position to the 685th position of SEQ ID NO:l (hereinafter, 

25 referred to as MDTS6Cys) , which was used in the following 

Examples, was constructed by amplifying a gene by PCR using 
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the full-length protein expression plasmid constructed in 
Example 5 as the template and the oligoDNA sequences 
represented by SEQ ID NO: 7 and SEQ ID NO: 17 as primers and 
using PyroBest DNA polymerase, digesting the gene with 
5 restriction enzymes Xbal and NotI, and then inserting the 
resulting fragment into the Xbal-Notl site of pCEP4dE2- 
FLAG. 

The aggrecanase activity of the various MDTS6 
proteins above were evaluated by the method of Example 7-3. 

10 As a result, no aggrecanase activity was found in the 

protein having amino acids of from the 1st position to the 
447st position of SEQ ID NO:l and in the protein having 
amino acids of from the 1st position to the 518st position 
of SEQ ID NO:l. That is, it was revealed that the first 

15 TSP-1 repeat sequence counting from the N- terminus is 

essential for exerting the aggrecanase activity of MDTS6 . 
Further , it was revealed that the ratio of the protein 
secreted to the supernatant of the culture was low in the 
case of the latter two proteins and the full-length 

2 0 protein . 

[0040] 

(Example 9-2) Degradation of natural type aggrecan 
A 90 \il portion of the MDTS6 enzyme solution prepared 
in Example 9-1 was mixed with a solution of 10 |xg natural 
25 type aggrecan (mfd. by Seikagaku Kogyo) /10 \il TBS in a test 
tube and allowed to undergo the reaction at 37°C overnight. 
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This reaction product was dried up using SpeedVac and then 
dissolved in 100 ^1 of 10 mM Tris-acetate buffer (pH 7.6) 
containing 0.06 unit of Chondroitinase ABC (mfd. by 
Seikagaku Kogyo), 0.024 unit of keratanase I (mfd. by 
5 Seikagaku Kogyo), 0.0004 unit of keratanase II (mfd. by 
Seikagaku Kogyo) , 5 of PMSF and 10 mM of EDTA, and the 
solution was allowed to undergo the reaction at 37°C 
overnight. A portion of this reaction solution was 
subjected to SDS-PAGE and then the product was detected 

10 using the mouse an ti -aggrecanase neoepitope antibody as 

shown in Example 7-3. In this case, the peroxidase-labeled 
goat anti -mouse IgG polyclonal antibody used was a 
preparation manufactured by Biosource. The same result was 
obtained by the BC-3 antibody which recognizes aggrecanase 

15 neoepitope (Hughes C.E. et al. , Biochemical J. , 305, 799 - 
804, 1995) . 

As a result, a band of 80 to 90 KDa was detected in 
the case of MDTS6Cys in addition to a band of about 150 
KDa. This degradation pattern is consistent with the 

20 pattern of main molecules (all generated by aggrecanase 

degradation) found in the joint synovial fluids of patients 
of joint diseases including OA and RA (Sandy J.D. et al., 
J. Clin. Invest., 89, 1512 - 1516, 1992; Lohmander L.S. et 
al., Arthritis Rheum., 36, 1214 - 1222, 1993) and also is 

25 consistent with the pattern of main molecules having 

aggrecanase neoepitope which are generated after 12 to 24 



hours of treatment with IL-1 and retinoic acid in an 
explant culture system of human knee joint cartilage 
(Little C.B. et al. , Biochemical J". , 344, 61 - 68, 1999) 
(Fig. 5) . 
5 [0041] 

(Example 10) Screening system of substances capable 
of modifying the aggrecanase activity 

(Example 10-1) Preparation of MDTS6Cys and substrate 

It was confirmed using the western blotting method 
10 shown in Example 9-2 that the recombinant aggrecan G1G2 and 
the natural type aggrecan are cleaved off at the 
aggrecanase site by MDTS6Cys without purification but as 
the culture supernatant prepared by the method of Example 
9-1. Also, the cleavage at the "aggrecanase site" was 
15 observed when the culturing in Example 9-1 was continued 

with the 10% FBS-containing medium without changing to the 
serum-free medium. Accordingly , the recombinant aggrecan 
61G2 prepared in Example 7-1 was used as the substrate. 

[0042] 

20 (Example 10-2) Screening system 

Though the screening can be carried out by the 
western blotting-aided method shown in Example 7-2 using 
the rAgg-Gl-G2 or natural type aggrecan as the substrate, 
the following ELISA system was constructed for screening 

25 more larger number of compounds to be tested. 
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An MDTS6Cys culture supernatant, the recombinant 
aggrecan G1G2 and a compound to be tested were mixed and 
allowed to undergo the reaction at 37°C for several hours, 
the resulting product was adhered to a 96 well plate (Nunc- 
5 Immuno™ Plate MaxiSorp™ Surface # 439454; mfd. by Nunc) , 
blocked with 1% BSA/TBS solution and then allowed to react 
with a mouse anti-aggrecanase neoepitope antibody and an 
HRP-anti-mouse IgG antibody conjugate (mfd. by Biosource) 
in that order, and then the detection was carried out using 

10 TMB Peroxidase EIA Substrate Kit (mfd. by Bio-Rad) to 

calculate the aggrecanase activity inhibiting strength of 
the compound to be tested using the coloring inhibition as 
a marker. Also, as a modified method thereof, rAggGl-G2 
was adhered to the 96 well plate (the same as above) and 

15 blocked with 1% BSA/TBS solution in advance and then an 
MDTS6Cys culture supernatant and a compound to be tested 
were added thereto and allowed to undergo the reaction at 
37°C for several hours, the resulting product was allowed 
to react with a mouse anti-aggrecanase neoepitope antibody 

20 and an HRP-anti-mouse IgG antibody conjugate (mfd. by 
Biosource) in that order, and then the detection was 
carried out using TMB Peroxidase EIA Substrate Kit (mfd. by 
Bio-Rad) to calculate the aggrecanase activity inhibiting 
strength of the compound to be tested using the coloring 

25 inhibition as a marker. By using this screening system, it 
was able to select N-a- [3- (N-hydroxycarbamoyl) -2~isobutyl- 



3-mercaptopropionyl]-N-o-dimethyltyrosineamide (Compound A) 

and N-ct- [3- (N -hydroxy carbamoyl) -2-isobutyl-4- 
mercaptopropionyl]-N-methylphenylalanineamide (Compound B) . 

[0043] 
5 (Example 11) 

(Example 11-1) Preparation of rabbit kn ee joint 
cartilage primary culture cells 

After killing a rabbit (Japanese white species, male, 
1.0 to 1.5 kg) under excess anesthesia, a knee joint was 
10 excised and the cartilage layer on the joint surface was 
removed and finely cut using a surgical knife. The cut 
pieces were treated with trypsin-EDTA (0.25%-l mM; mfd. by 
GIBCO-BRL) at 37°C for 1 hour and then centrifuged at 1,500 
rpm for 5 minutes, and the resulting precipitate was washed 
15 with DMEM. This was treated with collagenase A (0.15%; 
Boehringer-Mannheim) /DMEM at 37°C for 3 to 4 hours, and 
then a nylon mesh filter (100 \m, mfd. by Falcon) -passed 
fraction was centrifuged at 1,500 rpm for 5 minutes to 
effect precipitation of cartilage cells. The cells were 
20 thoroughly washed with DMEM/ 10% FBS medium, suspended in 
DMEM/10% FBS medium to a density of 2 x 10 5 cells/ml and 
then inoculated in 200 ul/well portions into an I type 
collagen-coated 96 well plate (mfd. by Asahi Technoglass) . 
Three days thereafter, the medium was changed to 200 ul of 
25 DMEM/10% FBS medium containing 50 ug/ml of ascorbic acid 
(ascorbic acid medium hereinafter) , and the culturing was 
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continued for 3 days . When an I type collagen-coated 6 
well plate (Asahi Technoglass) was used, the cell 
suspension was inoculated in 6 ml/well portions and 
cultured by carrying out the same medium exchange. These 
5 cells were used in the following test. 
[0044] 

(Example 11-2) Proteoglycan degradation of rabbit 
knee joint cartilage primary culture cells 

The rabbit knee joint cartilage primary culture cells 

10 of 96 well plate described in Example 11-1 were cultured 
for 2 days using 200 jil of the ascorbic acid medium 
supplemented with 10 jiCi/ml in final concentration of 
Na 2 35 S0 4 and labeled therewith, washed 3 times with 200 \il 
of the ascorbic acid medium and then cultured for 1 day 

15 using 200 jal of the ascorbic acid medium. After 

stimulation with IL-lp or all-trans retinoic acid and 
subsequent 0, 24, and 48 hours of culturing, the culture 
supernatants were recovered in 20 |il portions, and the 
radioactivity was measured using Top Count (mfd. by 

20 Packard) . As a result, increase in the radioactivity, 

namely release of proteoglycan, was observed by 0.01 to 10 
ng/ml of IL-ip stimulation, and increase in the 
concentration-dependent and strong radioactivity, namely 
release of proteoglycan, was observed by 0 . 1 to 10 jiM of 

25 all- trans retinoic acid stimulation (Pig. 6) . 
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[0045] 

(Example 11-3) Induction of MDTS6 mRNA expression 
After 3 days of culturing of the rabbit knee joint 
cartilage primary culture cells of 6 well plate described 
5 in Example 11-1 by changing the medium to ascorbic acid 
medium, 10 ng/ml of IL-lp or 10 |oM of all-trans retinoic 
acid was added thereto, and total RNA samples 2 and 6 hours 
thereafter were prepared using ISOGEN (mfd. by Nippon Gene) 
in accordance with the attached instructions. Each of the 

10 samples was treated with DNase I (mfd. by Nippon Gene) , 

subjected to phenol/chloroform treatment and then recovered 
by ethanol precipitation, and the thus purified total RNA 
was dissolved in DEPC-treated sterile water. Using random 
hexamers as primers, 1 \ig of this total RNA was subjected 

15 to reverse transcription reaction and RNase H treatment 
using Thermoscript™ RT-PCR System (mfd. by GIBCO-BRL, 
catalog number 11146-016) in accordance with the attached 
instructions, and the product was diluted 10 times with 
sterile water and used as a cDNA sample. Using 5 ^1 of 

20 each of the thus obtained cDNA samples as the template and 
the oligoDNA sequences represented by SEQ ID NO: 18 and SEQ 
ID NO: 19 as primers, PCR was carried out under a condition 
of 94°C for 2 minutes, 45 repetitions of a cycle of 94°C 
for 30 seconds, 65°C for 30 seconds and 72°C for 30 

25 seconds, and subsequent 72°C for 10 minutes. The reaction 
products were subjected to 2% agarose electrophoresis, and 
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densities of the generated DNA fragments were compared. As 
a result, expression of the MDTS6 mRNA was induced by IL-ip 
and all-trans retinoic acid, and the expression strength 
correlated with the degree of proteoglycan degradation 
5 described in Example 11-2 (Fig. 7) . 
[0046] 

(Example 12) Inhibition of proteoglycan degradation 
in rabbit knee joint cartilage primary culture cells by 
substances which inhibit the aggrecanase activity 

10 Each of the compounds A and B selected by the 

screening system of Example 10-2 was added to the 
proteoglycan degradation system of rabbit knee joint 
cartilage primary culture cells just before the stimulation 
with 10 of all- trans retinoic acid, and their 

15 proteoglycan degradation inhibitory activities were 

examined. As a result, the compounds A and B showed the 
inhibition of proteoglycan degradation in a concentration - 
dependent manner (Fig. 8) . On the other hand, the 
proteoglycan degradation inhibition action was not observed 

20 by compounds which have the same hydroxamic acid backbone 
but show a weak aggrecanase activity inhibition, even at a 
concentration of 100 nM. 
[0047] 

(Example 13) Analysis of MDTS6 promoter region DNA 
25 sequence 
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A DNA fragment corresponding to the promoter region 
of MDTS6 was amplified using PCR from GenomeWalker DNA Sea 
I Libraries (Genome Walker™ Kits, CLONTECH catalog number 
Kl 803-1) . OligoDNA sequences of the adapter primers AP-1 
5 (SEQ ID NO: 20) and AP-2 (SEQ ID NO: 21) attached to the kit 
were used as forward primers, and the oligoDNA sequences of 
SEQ ID NO: 22 and SEQ ID NO: 23 as reverse primers. The 
illustrative method was as described in the instructions 
attached to the kit, but TAKARA LA Taq (TAKARA LA Taq™, 

10 catalog number RR002A) was used in the PCR. The first PCR 
was carried out using the oligoDNA sequences of SEQ ID 
NO: 20 and SEQ ID NO: 22 as primers under a condition of 7 
repetitions of a cycle of 98°C for 5 seconds and 72°C for 3 
minutes, 32 repetitions of a cycle of 98°C for 5 seconds 

15 and 67°C for 3 minutes, and 67°C for 4 minutes. The second 
PCR was carried out under the same conditions using 5 y.1 of 
a solution prepared by diluting the reaction solution of 
the first reaction 50 times with TE buffer (10 mM Tris-HCl, 
1 mM EDTA, pH 8.0) as the template, and the oligoDNA 

20 sequences of SEQ ID NO: 21 and SEQ ID NO: 23 as primers. 

When the thus amplified DNA fragment of about 3 . 7 kbp was 
directly subjected to sequence analysis by dideoxy 
terminator method using ABI377 DNA Sequencer (Applied 
Biosystems Inc.), DNA sequences of about 2.2 kbp, 0.36 kbp 

25 and 0.8 kbp divided by two un-decipherable gaps were found. 
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Next, in order to decipher sequences of these two gap 
moieties which were unable to decipher by the direct 
analysis of the PCR-amplif ied DNA fragment, this DNA 
fragment was subcloned and the DNA nucleotide sequence was 
5 determined. In this connection, P ZErO™-2 vector (Zero 
Background/Kan Cloning Kit, mfd. by Invitrogen, catalog 
number K2600-01) was used as the cloning vector, and the 
subcloning was carried out in accordance with the attached 
instructions . 

10 A plasmid prepared by inserting the above DNA 

fragment into the KpnI-XhoI site of a reporter plasmid P GV- 
B2 (mfd. by Toyo Ink) was introduced into HEK293 cell using 
FuGene-6, and the lucif erase activity after 28 or 48 hours 
of culturing under usual culturing conditions was measured 
15 using PicaGene coloring kit (mfd. by Toyo Ink, catalog 
number PGK-L100) . In this case, the measured value was 
normalized by the activity value of p-gal expressed by a 
simultaneously introduced p-gal expression plasmid pCHHO 
(Amersham Pharmacia Biotech, catalog number 27-4508-01) . 
20 The P-gal activity was measured using Galacto-Light Plus 
Kit (mfd. by TROPIX, catalog number BL300P) . As a result, 
distinct increase in the luciferase activity which cannot 
be found in the original plasmid P GV-B2 was observed. This 
result indicates that the promoter activity is present in 
25 the above DNA fragment. 
[0048] 
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[Effects of the Invention] 

The novel metalloprotease obtained by the invention 
is characterized in that, since it has a protease activity, 
it can be used as a medicament and for the screening of a 
5 substance (a compound, a peptide, an antibody or an 
antibody fragment) to be used in a medicament which 
significantly modifies activity of the protease. In this 
case, regarding the medicinal use of the protease and a 
substance which significantly modifies activity of the 

10 protease, it is effective for diseases which are caused by 
acceleration, reduction, degeneration and the like 
abnormalities of the activity of the novel protease or in 
which the abnormalities are expressed to cause 
complications, such as cancer, arthritis, osteoarthritis 

15 and the like. Also, the metalloprotease of the invention 
is characterized in that, since it has aggrecanase 
activity, it can be used for the screening of the 
metalloprotease having aggrecanase activity and a substance 
(a compound, a peptide, an antibody or an antibody 

20 fragment) which significantly modifies the metalloprotease 
having aggrecanase activity. In this case, as the medical 
use of the metalloprotease having aggrecanase activity and 
a substance which significantly modifies the 
metalloprotease having aggrecanase activity, it is 

25 suggested that it is effective in preventing and treating 
diseases which are caused by acceleration, reduction, 
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degeneration and the like abnormalities of the activity of 
the novel protease or in which the abnormalities are 
expressed to cause complications, particularly joint 
diseases as diseases which show proteoglycan degradation, 
5 most particularly osteoarthritis. 
[Brief Description of the Drawings] 

[Fig. 1] Fig. 1 is a photograph showing a result 
of the expression of MDTS6TSP1 in an animal cell strain 
using an ECL western blotting detection system, obtained in 

10 Example 6. 

[Fig. 2] Fig. 2 is a photograph showing a result 
of the detection of the activity of MDTS6TSP1 to degrade a 
recombinant aggrecan G1G2 using an ECL western blotting 
detection system, obtained in Example 7-2 . 

15 [Fig. 3] Fig. 3 is a photograph showing a result 

of the analysts of a recombinant aggrecan 6162 degraded 
with MDTS6TSP1, by an anti-aggrecanase neoepitope antibody, 
using a western blotting detection system, obtained in 
Example 7-3. 

2 0 [Fig. 4] Fig. 4 is an electrophoresis pattern 

photograph showing a result of the examination of MDTS6 
inRNA expression induction by IL-ip, obtained in Example 8. 

[Fig. 5] Fig. 5 is a photograph showing a result 
of the detection of degradation of natural type aggrecan by 

25 MDTS6 protein, by an anti-aggrecanase neoepitope antibody, 
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using a western blotting detection system, obtained in 
Example 9-2 . 

[Fig. 6] Fig. 6 is a graph showing a result of the 
detection of release of proteoglycan from rabbit knee joint 
5 primary culture cells by all-trans retinoic acid and IL-lp, 
obtained in Example 11-2. 

[Fig. 7] Fig. 7 is an electrophoresis pattern 
photograph showing a result of the analysis of changes in 
gene expression of MDTS6 by RT-PCR when rabbit knee joint 
10 primary culture cells are treated with all- trans retinoic 
acid and IL-lf}, obtained in Example 11-3. 

[Fig. 8] Fig. 8 is a graph showing that 
degradation and release of proteoglycan from rabbit knee 
joint primary culture cells by all-trans retinoic acid are 
15 inhibited by the compound A and compound B, obtained in 
Example 12. 
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[Designation of Document] Abstract 
[Abstract] 

[Problem] To provide a recombinant protein which is 
necessary for isolating and identifying genes coding for 
5 novel ADAMTS proteins having a markedly high possibility as 
drug creation target molecules and for screening substances 
capable of modifying their activities. 

[Means for Resolution] A gene coding for a novel protein 
classified into the ADAMTS family was isolated, its 

10 complete ORP sequence was determined and the protein was 
expressed. A vector containing this gene, a host cell 
containing this vector, a method for producing the protein 
using this host cell and a method for screening this 
protein and compounds, peptides and antibodies capable of 

15 modifying activity of the protein were established. 
[Selected Drawing] None 
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[Sequence Listing] 

SEQUENCE LISTING 



<110> Yamanouchi Pharmaceutical Co., Ltd. 
<120> Novel aggrecanase 

<130> WP32802941 
<160> 31 

<170> Patentln Ver. 2. 0 

<210> 1 

<211> 950 

<212> PRT 

<213> Homo sapiens 

<400> 1 

Met Leu Leu Leu Gly He Leu Thr Leu Ala Phe Ala Gly Arg Thr Ala 

15 10 15 

Gly Gly Phe Glu Pro Glu Arg Glu Val Val Val Pro He Arg Leu Asp 

20 25 30 

Pro Asp He Asn Gly Arg Arg Tyr Tyr Trp Arg Gly Pro Glu Asp Ser 

35 40 45 

Gly Asp Gin Gly Leu He Phe Gin He Thr Ala Phe Gin Glu Asp Phe 

50 55 60 

Tyr Leu His Leu Thr Pro Asp Ala Gin Phe Leu Ala Pro Ala Phe Ser 
65 70 75 80 



Thr Glu His Leu Gly Val Pro Leu Gin Gly Leu Thr Gly Gly Ser Ser 

85 90 95 

Asp Leu Arg Arg Cys Phe Tyr Ser Gly Asp Val Asn Ala Glu Pro Asp 

100 105 110 

Ser Phe Ala Ala Val Ser Leu Cys Gly Gly Leu Arg Gly Ala Phe Gly 

115 120 125 

Tyr Arg Gly Ala Glu Tyr Val lie Ser Pro Leu Pro Asn Ala Ser Ala 

130 135 140 

Pro Ala Ala Gin Arg Asn Ser Gin Gly Ala His Leu Leu Gin Arg Arg 
145 150 155 160 

Gly Val Pro Gly Gly Pro Ser Gly Asp Pro Thr Ser Arg Cys Gly Val 

165 170 175 

Ala Ser Gly Trp Asn Pro Ala He Leu Arg Ala Leu Asp Pro Tyr Lys 

180 185 190 

Pro Arg Arg Ala Gly Phe Gly Glu Ser Arg Ser Arg Arg Arg Ser Gly 

195 200 205 

Arg Ala Lys Arg Phe Val Ser lie Pro Arg Tyr Val Glu Thr Leu Val 

210 215 220 

Val Ala Asp Glu Ser Met Val Lys Phe His Gly Ala Asp Leu Glu His 
225 230 235 240 

Tyr Leu Leu Thr Leu Leu Ala Thr Ala Ala Arg Leu Tyr Arg His Pro 

245 250 255 

Ser He Leu Asn Pro He Asn He Val Val Val Lys Val Leu Leu Leu 

260 265 270 

Arg Asp Arg Asp Ser Gly Pro Lys Val Thr Gly Asn Ala Ala Leu Thr 

275 280 285 

Leu Arg Asn Phe Cys Ala Trp Gin Lys Lys Leu Asn Lys Val Ser Asp 

290 295 300 

Lys His Pro Glu Tyr Trp Asp Thr Ala He Leu Phe Thr Arg Gin Asp 



305 310 315 320 

Leu Cys Gly Ala Thr Thr Cys Asp Thr Leu Gly Met Ala Asp Val Gly 

325 330 335 

Thr Met Cys Asp Pro Lys Arg Ser Cys Ser Val He Glu Asp Asp Gly 

340 345 350 

Leu Pro Ser Ala Phe Thr Thr Ala His Glu Leu Gly His Val Phe Asn 

355 360 365 

Met Pro His Asp Asn Val Lys Val Cys Glu Glu Val Phe Gly Lys Leu 

370 375 380 

Arg Ala Asn His Met Met Ser Pro Thr Leu He Gin He Asp Arg Ala 
385 390 395 400 

Asn Pro Trp Ser Ala Cys Ser Ala Ala He He Thr Asp Phe Leu Asp 

405 410 415 

Ser Gly His Gly Asp Cys Leu Leu Asp Gin Pro Ser Lys Pro He Ser 

420 425 430 

Leu Pro Glu Asp Leu Pro Gly Ala Ser Tyr Thr Leu Ser Gin Gin Cys 

435 440 445 

Glu Leu Ala Phe Gly Val Gly Ser Lys Pro Cys Pro Tyr Met Gin Tyr 

450 455 460 

Cys Thr Lys Leu Trp Cys Thr Gly Lys Ala Lys Gly Gin Met Val Cys 
465 470 475 480 

Gin Thr Arg His Phe Pro Trp Ala Asp Gly Thr Ser Cys Gly Glu Gly 

485 490 495 

Lys Leu Cys Leu Lys Gly Ala Cys Val Glu Arg His Asn Leu Asn Lys 

500 505 510 

His Arg Val Asp Gly Ser Trp Ala Lys Trp Asp Pro Tyr Gly Pro Cys 

515 520 525 

Ser Arg Thr Cys Gly Gly Gly Val Gin Leu Ala Arg Arg Gin Cys Thr 
530 535 540 



Asn Pro Thr Pro Ala Asn Gly Gly Lys Tyr Cys Glu Gly Val Arg Val 
545 550 555 560 

Lys Tyr Arg Ser Cys Asn Leu Glu Pro Cys Pro Ser Ser Ala Ser Gly 

565 570 575 

Lys Ser Phe Arg Glu Glu Gin Cys Glu Ala Phe Asn Gly Tyr Asn His 

580 585 590 

Ser Thr Asn Arg Leu Thr Leu Ala Val Ala Trp Val Pro Lys Tyr Ser 

595 600 605 

Gly Val Ser Pro Arg Asp Lys Cys Lys Leu He Cys Arg Ala Asn Gly 

610 615 620 

Thr Gly Tyr Phe Tyr Val Leu Ala Pro Lys Val Val Asp Gly Thr Leu 
625 630 635 640 

Cys Ser Pro Asp Ser Thr Ser Val Cys Val Gin Gly Lys Cys He Lys 

645 650 655 

Ala Gly Cys Asp Gly Asn Leu Gly Ser Lys Lys Arg Phe Asp Lys Cys 

660 665 670 

Gly Val Cys Gly Gly Asp Asn Lys Ser Cys Lys Lys Val Thr Gly Leu 

675 680 685 

Phe Thr Lys Pro Met His Gly Tyr Asn Phe Val Val Ala He Pro Ala 

690 695 700 

Gly Ala Ser Ser He Asp lie Arg Gin Arg Gly Tyr Lys Gly Leu lie 
705 710 715 720 

Gly Asp Asp Asn Tyr Leu Ala Leu Lys Asn Ser Gin Gly Lys Tyr Leu 

725 730 735 

Leu Asn Gly His Phe Val Val Ser Ala Val Glu Arg Asp Leu Val Val 

740 745 750 

Lys Gly Ser Leu Leu Arg Tyr Ser Gly Thr Gly Thr Ala Val Glu Ser 

755 760 765 

Leu Gin Ala Ser Arg Pro lie Leu Glu Pro Leu Thr Val Glu Val Leu 



770 775 780 

Ser Val Gly Lys Met Thr Pro Pro Arg Val Arg Tyr Ser Phe Tyr Leu 
785 790 795 800 

Pro Lys Glu Pro Arg Glu Asp Lys Ser Ser His Pro Lys Asp Pro Arg 

805 810 815 

Gly Pro Ser Val Leu His Asn Ser Val Leu Ser Leu Ser Asn Gin Val 

820 825 830 

Glu Gin Pro Asp Asp Arg Pro Pro Ala Arg Trp Val Ala Gly Ser Trp 

835 840 845 

Gly Pro Cys Ser Ala Ser Cys Gly Ser Gly Leu Gin Lys Arg Ala Val 

850 855 860 

Asp Cys Arg Gly Ser Ala Gly Gin Arg Thr Val Pro Ala Cys Asp Ala 
865 870 875 880 

Ala His Arg Pro Val Glu Thr Gin Ala Cys Gly Glu Pro Cys Pro Thr 

885 890 895 

Trp Glu Leu Ser Ala Trp Ser Pro Cys Ser Lys Ser Cys Gly Arg Gly 

900 905 910 

Phe Gin Arg Arg Ser Leu Lys Cys Val Gly His Gly Gly Arg Leu Leu 

915 920 925 

Ala Arg Asp Gin Cys Asn Leu His Arg Lys Pro Gin Glu Leu Asp Phe 

930 935 940 

Cys Val Leu Arg Pro Cys 
945 950 



<210> 2 

<211> 2853 

<212> DNA 

<213> Homo sapiens 



<400> 2 






atgcttttgc 


tgggcatcct 


aaccctggct 


ccagagcggg 


aggtagtcgt 


tcccatccga 


tactggcggg 


gtcccgagga 


etceggggat 


caggaggact 


tttacctaca 


cctgacgccg 


actgagcatc 


tgggcgtccc 


cctccagggg 


tgcttctatt 


CtE£££aC£t 

oooo o 




PPPPPPCtCP 


ppppapcptt 


tppctacppa 


aatpctapcp 

CiCi ege LUgVjg 


PPP C PPPPPP 
uge/Uggegge/ 


ppappppaap 


pptpt t pp per 


gp PPPCC t t r 


PPPa PaPPPP 
t-ggagai-'eee 


7\7\CC P P PC Pa 


tPPtaPPPPP 


P P 1" fir an p p p t 


aptppta ppp 

Cig ei/g tuguu 




t PPPPPP PPP 


papapppt 


t pptrpr ppa 

i*55 eogC/ggci 


^ 0 a O eOCiCl 


t a t p t pr t pa 

I'd e e/ vgu 1/ gCl 


p CP t pp t p0r 


eK-*e/ggeggOg 




t rpt t pt ppt 


^dagg e ge eg 


ptcapppppa 


a t ppppppp t 


^a p pp t" pp pp 


aaaptpaertp 


acaaecaccc 


cpaptactpp 

^ o*-* g ec*e egg 


ctetcteeae 


ccaccaccte 


teacaccctp 


cccaagagaa 


BCtCCtctgt 


cattgaggac 


cacgagctgg 


gecaegtett 


caacatgccc 


tttgggaagc 


tccgagccaa 


ccacatgatg 


aacccctggt 


cagcctgcag 


tgctgccatc 


gactgcctcc 


tggaccaacc 


cagcaagccc 


agctacaccc 


tgagecagea 


gtgcgagctg 


tacatgeagt 


actgcaccaa 


gctgtggtgc 


cagacccgcc 


acttcccctg 


ggccgatggc 


aaaggggect 


gcgtggagag 


acacaacctc 


aaatgggatc 


cctatggccc 


ctgctcgcgc 


aggcagtgea 


ccaaccccac 


ccctgccaac 



ttcgccgggc 


gaaccgctgg 


aggctctgag 


60 


ctggacccgg 


acattaaegg 


ccgccgctac 


120 


cagggactca 


tttttcagat 


cacagcattt 


180 


gatgetcagt 


tcttggctcc 


cgccttctcc 


240 


ctcacceeec 


gctcttcaga 


cctcceacgc 


300 


pp ppapt ppt 


tppptpct pt 

vegu eg*-* eg w 


eaecctet 

gugee i»5 ego 


360 


PPP PPP P3 pt 
ggegOL/gClg L 


at pt pa 1 1 a P" 

aegee-aeeag 


P PP PP t PPPP 


420 


ageedgggug 


mmcct t p t 
eaedve e tv e 




480 


orptp+pcrpt 
dUC LO ttgt e 




u ^^g&g^ ^gg 




tcapQacrppcrp 
tdeadstegt 




p 1 1 p pppp pic 

U Luggggdg 




ddgug e t Lug 


t crt p t 9k\ p p p 
eg eu u<3 leue 


peppf ap pt p 
Sugg e<3e.g eg 


660 


pt n & 0i~ f C C 


<3t/ggL/gL/gga 


ppt pppap^t 


7?0 


pfyar-tptap/" 1 


crp rat p pp ?} P" 


pa t PP tPfiflP 
i<ee leoae 


780 


Pt t P 1 1 PCTfl £7 

eeeeeecgag 


P t P Ct {TP P t P 


CPPPC-CC£\r\P 


840 


a a p 1 1 p 1" pt p 
ciai> e ee eg eg 


pp t PP'P/^P'fla 


pa a pp t pa a p 


900 


cap apt grra 


tcctcttaac 

wee ui> e tectu 


uaggeaggao 


960 


ppcatpgctp 


ateteectac 


catetetRac 


1020 


eateeecttc 


catcagcctt 


caccactgcc 


1080 


catgacaatg 


teaaaetctg 


tgaggaggtg 


1140 


tccccgaccc 


tcatccagat 


cgaccgtgcc 


1200 


atcaccgact 


tcctggacag 


egggcaeggt 


1260 


atctccctgc 


ccgaggatct 


gccgggcgcc 


1320 


gcttttggcg 


tgggctccaa 


gccctgtcct 


1380 


acegggaagg 


ccaagggaca 


gatgg xgtgc 


1 *t4v 


accagctgtg 


gcgagggcaa 


gctctgcctc 


1500 


aacaagcaca 


gggtggatgg 


ttcctgggcc 


1560 


acatgtggtg 


ggggcgtgca 


getggecagg 


1620 


gggggcaagt 


actgegaggg 


agtgagggtg 


1680 



aaataccgat 


cctgcaacct 


Kgagccctsc 

ooo ^o 


gagRaecaet 

ooo o o 


£tea££cttt 

o ooo ** ^ 


caac££ctac 




tcrcccaaeta 








O V «-> O liU l(g l>g 






P at P t at at P 

LU tg 




gtp +■ r» p a c» oro c» 
gO LUL/ddgdd 


gdgd L tLgaU 


do*-' Lg^dagd 


agg^gac tgg 


at It L Ltatu 


fT O O t" ^> ^> r\ /T 

gCCdLCCCCg 


caggcgcctc 


r-v r> /*t 4~ /> rr r*\ /•> 

aagcaxegae 


gggga xgaca 


actacctggc 


tctgaagaac 


txcgtggtgt 


cggcggtgga 


gcgggacctg 


ggcacgggca 


cagcggtgga 


gagectgeag 


gtggaggtcc 


tctccgtggg 


gaagatgaca 


cccaaagagc 


ctcgggagga 


caagtcctct 


ttgcacaaca 


gcgtcctcag 


cctctccaac 


gcacgctggg 


tggctggcag 


ctgggggccg 


aagcgggcgg 


tggactgccg 


gggctccgcc 


gcccatcggc 


ccgtggagac 


acaagcctgc 


gcctggtcac 


cctgctccaa 


gagctgegge 


gtgggccacg 


gaggccggct 


gctggcccgg 


gagctggact 


tctgcgtcct 


gaggccgtgc 



cccagctcag cctccggaaa gagcttcegg 1740 
aaccacagca ccaaccggct cactctcgcc 1800 
tctccccggg acaagtgcaa gctcatctgc 1860 
ctggcaccca aggtggtgga cggcacgctg 1920 
caaggcaagt gcatcaaggc tggctgtgat 1980 
aagtgtgggg tgtgtggggg agacaataag 2040 
aagcccatgc atggctacaa tttcgtggtg 2100 
atccgccagc gcggttacaa agggctgatc 2160 
agecaaggea agtacctget caaegggcat 2220 
gtggtgaagg geagtctget geggtacage 2280 
gcttcccggc ccatcctgga gccgctgacc 2340 
ccgccccggg tccgctactc cttctatctg 2400 
catcccaagg acccccgggg accctctgtc 2460 
caggtggagc ageeggaega caggccccct 2520 
tgctccgcga getgeggcag tggcctgcag 2580 
gggcagegea cggtccctgc ctgtgatgca 2640 
ggggagcect gccccacctg ggagctcagc 2700 
eggggattte agaggegetc actcaagtgt 2760 
gaecagtgea acttgcaccg caagccccag 2820 
tga 2853 



<210> 3 

<211> 50 

<212> DNA 

<213> Homo sapiens 



<400> 3 

ctagcgcggc cgcaggatcc gactacaagg acgacgatga caaatgataa 50 



<210> 4 
<211> 50 
<212> DNA 

<213> Homo sapiens 
<400> 4 

gatcttatca tttgtcatcg tcgtccttgt agtcggatcc tgcggccgcg 

<210> 5 

<211> 34 

<212> DNA 

<213> Homo sapiens 

<4O0> 5 

ggactagtct agaagctggg taccagctgc tagc 

<210> 6 

<211> 29 

<212> DNA 

<213> Homo sapiens 

<4O0> 6 

ggactagtgt cgaccggtca tggctgcgc 

<210> 7 

<211> 42 

<212> DNA 

<213> Homo sapiens 



<400> 7 



gtgtctagag ccatgctttt gctgggcatc ctaaccctgg ct 



42 



<210> 8 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 8 

agagcggccg cctgctcctc ccggaagctc tttccggagg c 41 

<210> 9 

<211> 27 

<212> DNA 

<213> Homo sapiens 



<400> 9 



aagcacaggg tggatggttc ctgggcc 



<210> 



10 



<211> 



37 



<212> 



DNA 



<213> 



Homo sapiens 



<400> 



10 



gcgcggccgc gcacggcctc aggacgcaga agtccag 



37 



<210> 11 
<211> 37 



<212> DNA 

<213> Homo sapiens 



<400> 11 

taggatcctt gtagaaactt cagaccatga caactcg 37 

<210> 12 
<211> 59 
<212> DNA 

<213> Homo sapiens 
<400> 12 

atggatcctc aatggtgatg gtgatgatga ccgaagcaga aggcatggtg ccgggacag 59 

<210> 13 
<211> 97 
<212> DNA 

<213> Homo sapiens 
<400> 13 

agcttgccac catgaagacg atcatcgccc tgagctacat cttctgcctg gtattcgccg 60 
actacaagga cgatgatgac aaggggatcc actagtc 97 

<210> 14 
<211> 97 
<212> DNA 

<213> Homo sapiens 



<400> 14 



tcgagactag tggatcccct tgtcatcatc gtccttgtag tcggcgaata ccaggcagaa 60 
gatgtagctc agggcgatga tcgtcttcat ggtggca 97 

<210> 15 
<211> 30 
<212> DNA 

<213> Homo sapiens 



<400> 15 

acctcagcag ccagctccct tgtatacaca 

<210> 16 

<211> 30 

<212> DNA 

<213> Homo sapiens 

<400> 16 

cttgaggggg atggaccaat acagctttgg 

<210> 17 

<211> 38 

<212> DNA 

<213> Homo sapiens 



30 



30 



<400> 17 

agagcggccg ctccagtcac cttcttgcag ctcttatt 



<210> 18 
<211> 27 



<212> DNA 

<213> Homo sapiens 
<400> 18 

gcggacgagt ccatggtcaa gttccac 

<210> 19 
<211> 27 
<212> DNA 

<213> Homo sapiens 
<400> 19 

ttctgccagg cgcagaagtt gcgcagc 

<210> 20 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 20 

gtaatacgac tcactatagg gc 

<210> 21 
<211> 19 
<212> DNA 

<213> Homo sapiens 



<400> 21 

actatagggc acgcgtggt 



<210> 22 
<211> 30 
<212> DNA 

<213> Homo sapiens 



<400> 22 

actgagcatc cggcgtcagg tgtaggtaaa 

<210> 23 
<211> 30 
<212> DNA 

<213> Homo sapiens 
<400> 23 

agtcctcctg aaatgctgtg atctgaaaaa 



30 



30 



<210> 24 

<211> 3473 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> promoter 
<222> 

<400> 24 

ttgcacagct aagatctggt ggaggcatgc acacagggcc ctctgaccat gggctctaaa 60 
tcactgtact atgttccctt ccataggcct caatcagtca tgtaatattt gacctggtcg 120 



ttatcttagg 


tattatctag 


accacagatt 


ttggatgcag 


ttctctggct 


gaagacctct 


180 


gagctaggat 


aaccccttct 


cttttgacag 


acgagtcaga 


gaatcagatc 


agtgatagaa 


240 


gtggagtgcc 


aatcctgagt 


atcacctcta 


ctcaagtgct 


caacatatcc 


ctagatcctc 


300 


aattccctgg 


caaaagtgat 


tggatggaac 


cacaggcttc 


caagagggga 


cagtcaagca 


360 


ttaaatacga 


gaatgcacat 


ataactcttg 


gtgcaatgtt 


tagcacatac 


taagcctgca 


420 


atacatgcta 


atccctttga 


gcaaatccac 


atggccagtt 


tctgtgctca 


ggggtgagaa 


480 


tagctgggct 


gtgattgggg 


cagggggagc 


actaagtggg 


agggacttcc 


tgtctcaggt 


540 


ccctgccatc 


ttgactgaca 


tgctgcagcc 


cttgccaaaa 


cccatgggtc 


agaatgaaag 


600 


taaagtgccg 


ttgaaaacct 


tgcaatccac 


ctttaaaact 


gccgggtgta 


gtaaaacaat 


660 


tgcttgcccc 


aaataaatga 


cttatcattg 


ctgttggttg 


tctgcgtttc 


tctttaatta 


720 


taggccctct 


ttgaacgctc 


aaacacacag 


ggcctttgta 


agcttgaact 


ccctgtctca 


780 


cacacagtcc 


tcccataccc 


atacactctc 


tttcatttgc 


agagtataaa 


cacccatctc 


840 


tcactcattc 


acataatgaa 


tttcagctcc 


ttgtgtccca 


atcaaggaga 


ggcctcactg 


900 


gaattatggg 


catctgagcc 


atcttcatgt 


tccaaggccc 


cagggggcgc 


ttccaagagt 


960 


ggatccttta 


tggggagaag 


ataatgggca 


aaaagtgctc 


ttcactgatg 


gaccagtccc 


1020 


agccttttct 


ctccttggac 


aatagagttc 


ttcccttgaa 


cagccacttc 


cctaaaaaaa 


1080 


attccaaaat 


tctcccacat 


catccccttt 


atgcttaaaa 


tcatcacaca 


ctcccttctt 


1140 


tgtcctcccc 


tcttgcaaac 


tcaactcaga 


gccctttggc 


tccagaaaga 


ttttctaggt 


1200 


atcaggagag 


agtagcaaag 


cctccctcct 


ctccttgcct 


ttctcccttg 


tcagagaaag 


1260 


aagttgattc 


tgcggagagg 


taagaaggat 


cttgaggtct 


agagcctgaa 


aaactccttg 


1320 


ggctgttctc 


caaactagat 


gggaacataa 


ggtgcgattg 


catcttctcc 


agctgatact 


1380 


cactcggcct 


cctatgccag 


tccccagtcc 


agggtttggt 


caagggtcaa 


atgagataat 


1440 


ttcatggagg 


aagcctggcc 


cgatttttct 


actgtttgct 


ggaagacagc 


ctcttcctct 


1500 


tgtaactgca 


gccccagaac 


ctgatctcca 


catccctgcc 


aggcaggtag 


ctgtgtacaa 


1560 


gggctcatct 


tcctgccccc 


aaccccagct 


ctgatttgct 


tattcaggtg 


gtgtaaatac 


1620 


ttctaccagg 


acctatttca 


agccattgtg 


atgtccctga 


ctggggagat 


gcagggcagc 


1680 


acaccattta 


atatttccct 


cacatttcca 


ccccattctg 


cactcttttc 


tgggagttgc 


1740 


tgtctcagag 


ggttggcggt 


tctggtggct 


caagaccata 


agtaattatc 


aaatacttag 


1800 


gaagcgacgg 


gttttgagta 


tttattacct 


tttaaaaatg 


tactttgtgg 


ctaggcatgg 


1860 



tggctcacgc ctgtagtccc cgcaccggga 
gagttcaaga ccagcctggg caacacggcg 
acacacacac acacacacac acacacacac 
tggtgtcgtg tgtctgtggt cgcagttact 
aggtaggagg atcacccgag gtcggtagtt 
ctgtctttac taaaaataca aaattagctg 
tacttgggag gctgagacag gagaatggct 
tgagatcgcg tcattgcact ccagcctggg 
aagaaatata tatatatatg tgtgtgtgtg 
tatgtgtata tatatgtatg tgtgtgtgtg 
gtaagtgtgt ttagtttaat ttttaataat 
cctgagaact gaagagtttg ccccagccca 
tggggctagg cggtcttggg ttccagaggg 
gcaataggac atttacgcgg gggggggggg 
tctgtgatta agaaacagca gggatgttgc 
ggtcagtgaa gggtcttcat tcctagctgc 
agacccagag agtgaacgct gtccgcgggg 
ggcataacca agagcaaaac gcaaactgag 
cgcctcctgc ctcagctcaa tgccactccc 
cgggaccgag ttctccggtg gcccctggag 
ggggagtgag gggaggggcg ctgactgggc 
tcgcccagcg cctggcttgc tgcgctgcga 
ccttccgcta gttctcggct gcaaatcttc 
gctcccaggg cgcgctttgc ttggaaaggc 
cgctcgccgc cctgggagga gtctccctcc 
gtcccgtagc gttggcggtt ccagagtgcg 
gagcccggcc cagccccttc ccacagcgcg 



ggccgaggtg ggtggattgc ttgagctcag 1920 
aaacccagtc tctaccaaaa atacacacac 1980 
acacacacac acacacaaat tggcctagcg 2040 
caggagacca aggtaggagg taggaaacca 2100 
cgagaccagc ctgaccaaca tggagaaacc 2160 
ggcgtggtgg tgcatgcctg taattccagc 2220 
tgaacccgga aggcggagtt tgcggtgagc 2280 
caacaagagc aaaactccgt ctcaaaaaaa 2340 
tgtgtgtgtg tgtatgtata tatatatatg 2400 
tgcatatata tatatacact ttgtttaatt 2460 
gtccgtgatt aacagctggc tggcaagatt 2520 
tccagcacac catgggccca gggcagacct 2580 
ctcccatgcc cctgtcctat tgctcttctg 2640 
tggttcttga ttctgggtct tttaggggac 2700 
aacagcaggg atgaggtggg cctggggacg 2760 
tgacctgatc tgccctgaga taaaagacta 2820 
gcagaagcga gtgaggcgtc gggacagtgg 2880 
acttcagcgc cggtttctcg ggccagccca 2940 
tccccgccaa gtggctctcc gctctggagg 3000 
gctccggcag cgagctctgg gaggctggga 3060 
cgtccaaaga ggagggggcc tttaataggc 3120 
gtggctgcgg ttgcgagaag ccgcccggca 3180 
gtccttgcac ttgacagcga ttgtacttaa 3240 
acaggtagga agcgcgggct gccgggtgca 3300 
cttggctctc ctttctggga actgccggct 3360 
ggctgcacgg agaccgcggc agcggccgga 3420 
gcggtgcgct gcccggcgcc atg 3473 



<210> 25 



<211> 3467 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> promoter 
<222> 



<400> 25 






ttgcacagct 


aagatctggt 


ggaggcatgc 


tcactgtact 


atgttccctt 


ccataggcct 


ttatcttagg 


tattatctag 


accacagatt 


gagctaggat 


aaccccttct 


cttttgacag 


gtggagtgcc 


aatcctgagt 


atcacctcta 


aattccctgg 


caaaagtgat 


tggatggaac 


ttaaatacga 


gaatgcacat 


ataactcttg 


atacatgcta 


atccctttga 


gcaaatccac 


tagctgggct 


gtgattgggg 


cagggggagc 


ccctgccatc 


ttgactgaca 


tgctgcagcc 


taaagtgccg 


ttgaaaacct 


tgcaatccac 


tgcttgcccc 


aaataaatga 


cttatcattg 


taggccctct 


ttgaacgctc 


aaacacacag 


cacacagtcc 


tcccataccc 


atacactctc 


tcactcattc 


acataatgaa 


tttcagctcc 


gaattatggg 


catctgagcc 


atcttcatgt 


ggatccttta 


tggggagaag 


ataatgggca 


agccttttct 


ctccttggac 


aatagagttc 


attccaaaat 


tctcccacat 


catccccttt 


tgtcctcccc 


tcttgcaaac 


tcaactcaga 



acacagggcc 


ctctgaccat 


gggctctaaa 


60 


caatcagtca 


tgtaatattt 


gacctggtcg 


120 


ttggatgcag 


ttctctggct 


gaagacctct 


180 


acgagtcaga 


gaatcagatc 


agtgatagaa 


240 


ctcaagtgct 


caacatatcc 


ctagatcctc 


300 


cacaggcttc 


caagagggga 


cagtcaagca 


360 


gtgcaatgtt 


tagcacatac 


taagcctgca 


420 


atggccagtt 


tctgtgctca 


ggggtgagaa 


480 


actaagtggg 


agggacttcc 


tgtctcaggt 


540 


cttgccaaaa 


cccatgggtc 


agaatgaaag 


600 


ctttaaaact 


gccgggtgta 


gtaaaacaat 


660 


ctgttggttg 


tctgcgtttc 


tctttaatta 


720 


ggcctttgta 


agcttgaact 


ccctgtctca 


780 


tttcatttgc 


agagtataaa 


cacccatctc 


840 


ttgtgtccca 


atcaaggaga 


ggcctcactg 


900 


tccaaggccc 


cagggggcgc 


ttccaagagt 


960 


aaaagtgctc 


ttcactgatg 


gaccagtccc 


1020 


ttcccttgaa 


cagccacttc 


cctaaaaaaa 


1080 


atgcttaaaa 


tcatcacaca 


ctcccttctt 


1140 


gccctttggc 


tccagaaaga 


ttttctaggt 


1200 



atcaggagag 


agtagcaaag 


cctccctcct 


ctccttgcct 


ttctcccttg 


tcagagaaag 


1260 


aagttgattc 


tgcggagagg 


taagaaggat 


cttgaggtct 


agagcctgaa 


aaactccttg 


1320 


ggctgttctc 


caaactagat 


gggaacataa 


ggtgcgattg 


catcttctcc 


agctgatact 


1380 


cactcggcct 


cctatgccag 


tccccagtcc 


agggtttggt 


caagggtcaa 


atgagataat 


1440 


ttcatggagg 


aagcctggcc 


cgatttttct 


actgtttgct 


ggaagacagc 


ctcttcctct 


1500 


tgtaactgca 


gccccagaac 


ctgatctcca 


catccctgcc 


aggcaggtag 


ctgtgtacaa 


1560 


gggctcatct 


tcctgccccc 


aaccccagct 


ctgatttgct 


tattcaggtg 


gtgtaaatac 


1620 


ttctaccagg 


acctatttca 


agccattgtg 


atgtccctga 


ctggggagat 


gcagggcagc 


1680 


acaccattta 


atatttccct 


cacatttcca 


ccccattctg 


cactcttttc 


tgggagttgc 


1740 


tgtctcagag 


ggttggcggt 


tctggtggct 


caagaccata 


agtaattatc 


aaatacttag 


1800 


gaagcgacgg 


gttttgagta 


tttattacct 


tttaaaaatg 


tactttgtgg 


ctaggcatgg 


1860 


tggctcacgc 


ctgtagtccc 


cgcaccggga 


ggccgaggtg 


ggtggattgc 


ttgagctcag 


1920 


gagttcaaga 


ccagcctggg 


caacacggcg 


aaacccagtc 


tctaccaaaa 


atacacacac 


1980 


acacacacac 


acacacacac 


acacacacac 


acacacacac 


aaattggcct 


agcgtggtgt 


2040 


cgtgtgtctg 


tggtcgcagt 


tactcaggag 


accaaggtag 


gaggtaggaa 


accaaggtag 


2100 


gaggatcacc 


cgaggtcggt 


agttcgagac 


cagcctgacc 


aacatggaga 


aaccctgtct 


2160 


ttactaaaaa 


tacaaaatta 


gctgggcgtg 


gtggtgcatg 


cctgtaattc 


cagctacttg 


2220 


ggaggctgag 


acaggagaat 


ggcttgaacc 


cggaaggcgg 


agtttgcggt 


gagctgagat 


2280 


cgcgtcattg 


cactccagcc 


tgggcaacaa 


gagcaaaact 


ccgtctcaaa 


aaaaaagaaa 


2340 


tatatatata 


tatgtgtgtg 


tgtgtgtgtg 


tgtgtgtatg 


tatatatata 


tatgtatgtg 


2400 


tatatatatg 


tatgtgtgtg 


tgtgtgcata 


tatatatata 


cactttgttt 


aattgtaagt 


2460 


gtgtttagtt 


taatttttaa 


taatgtccgt 


gattaacagc 


tggctggcaa 


gattcctgag 


2520 


aactgaagag 


tttgccccag 


cccatccagc 


acaccatggg 


cccagggcag 


accttggggc 


2580 


taggcggtct 


tgggttccag 


agggctccca 


tgcccctgtc 


ctattgctct 


tctggcaata 


2640 


ggacatttac 


gcgggggggg 


ggggtggttc 


ttgattctgg 


gtcttttagg 


ggactctgtg 


2700 


attaagaaac 


agcagggatg 


ttgcaacagc 


agggatgagg 


tgggcctggg 


gacgggtcag 


2760 


tgaagggtct 


tcattcctag 


ctgctgacct 


gatctgccct 


gagataaaag 


actaagaccc 


2820 


agagagtgaa 


cgctgtccgc 


gggggcagaa 


gcgagtgagg 


cgtcgggaca 


gtggggcata 


2880 


accaagagca 


aaacgcaaac 


tgagacttca 


gcgccggttt 


ctcgggccag 


cccacgcctc 


2940 



ctgcctcagc 


tcaatgccac 


tccctccccg 


cgagttctcc 


ggtggcccct 


ggaggctccg 


tgaggggagg 


ggcgctgact 


gggccgtcca 


agcgcctggc 


ttgctgcgct 


gcgagtggct 


gctagttctc 


ggctgcaaat 


cttcgtcctt 


agggcgcgct 


ttgcttggaa 


aggcacaggt 


ccgccctggg 


aggagtctcc 


ctcccttggc 


tagcgttggc 


ggttccagag 


tgcgggctgc 


ggcccagccc 


cttcccacag 


cgcggcggtg 



<210> 26 
<211> 3464 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> promoter 
<222> 



<400> 26 






ttgcacagct 


aagatctggt 


ggaggcatgc 


tcactgtact 


atgttccctt 


ccataggcct 


ttatcttagg 


tattatctag 


accacagatt 


gagctaggat 


aaccccttct 


cttttgacag 


gtggagtgcc 


aatcctgagt 


atcacctcta 


aattccctgg 


caaaagtgat 


tggatggaac 


ttaaatacga 


gaatgcacat 


ataactcttg 


atacatgcta 


atccctttga 


gcaaatccac 


tagctgggct 


gtgattgggg 


cagggggagc 



ccaagtggct 


ctccgctctg gaggcgggac 


3000 


gcagcgagct 


ctgggaggct gggaggggag 


3060 


aagaggaggg 


ggcctttaat aggctcgccc 


3120 


gcggttgcga 


gaagccgccc ggcaccttcc 


3180 


gcacttgaca 


gcgattgtac ttaagctccc 


3240 


aggaagcgcg 


ggctgccggg tgcacgctcg 


3300 


tctcctttct 


gggaactgcc ggctgtcccg 


3360 


acggagaccg 


cggcagcggc cggagagccc 


3420 


cgctgcccgg 


cgccatg 


3467 


acacagggcc 


ctctgaccat gggctctaaa 


60 


caatcagtca 


tgtaatattt gacctggtcg 


120 


ttggatgcag 


ttctctggct, gaagacctct 


180 


acgagtcaga 


gaatcagatc agtgatagaa 


240 


ctcaagtgct 


caacatatcc ctagatcctc 


300 


cacaggcttc 


caagagggga cagtcaagca 


360 


gtgcaatgtt 


tagcacatac taagcctgca 


420 


atggccagtt 


tctgtgctca ggggtgagaa 


480 


actaagtggg 


agggacttcc tgtctcaggt 


540 



ccctgccatc ttgactgaca tgctgcagcc cttgccaaaa cccatgggtc agaatgaaag 600 
taaagtgccg ttgaaaacct tgcaatccac ctttaaaact gccgggtgta gtaaaacaat 660 
tgcttgcccc aaataaatga cttatcattg ctgttggttg tctgcgtttc tctttaatta 720 
taggccctct ttgaacgctc aaacacacag ggcctttgta agcttgaact ccctgtctca 780 
cacacagtcc tcccataccc atacactctc tttcatttgc agagtataaa cacccatctc 840 
tcactcattc acataatgaa tttcagctcc ttgtgtccca atcaaggaga ggcctcactg 900 
gaattatggg catctgagcc atcttcatgt tccaaggccc cagggggcgc ttccaagagt 960 
ggatccttta tggggagaag ataatgggca aaaagtgctc ttcactgatg gaccagtccc 1020 
agccttttct ctccttggac aatagagttc ttcccttgaa cagccacttc cctaaaaaaa 1080 
attccaaaat tctcccacat catccccttt atgcttaaaa tcatcacaca ctcccttctt 1140 
tgtcctcccc tcttgcaaac tcaactcaga gccctttggc tccagaaaga ttttctaggt 1200 
atcaggagag agtagcaaag cctccctcct ctccttgcct ttctcccttg tcagagaaag 1260 
aagttgattc tgcggagagg taagaaggat cttgaggtct agagcctgaa aaactccttg 1320 
ggctgttctc caaactagat gggaacataa ggtgcgattg catcttctcc agctgatact 1380 
cactcggcct cctatgccag tccccagtcc agggtttggt caagggtcaa atgagataat 1440 
ttcatggagg aagcctggcc cgatttttct actgtttgct ggaagacagc ctcttcctct 1500 
tgtaactgca gccccagaac ctgatctcca catccctgcc aggcaggtag ctgtgtacaa 1560 
gggctcatct tcctgccccc aaccccagct ctgatttgct tattcaggtg gtgtaaatac 1620 
ttctaccagg acctatttca agccattgtg atgtccctga ctggggagat gcagggcagc 1680 
acaccattta atatttccct cacatttcca ccccattctg cactcttttc tgggagttgc 1740 
tgtctcagag ggttggcggt tctggtggct caagaccata agtaattatc aaatacttag 1800 
gaagcgacgg gttttgagta tttattacct tttaaaaatg tactttgtgg ctaggcatgg 1860 
tggctcacgc ctgtagtccc cgcaccggga ggccgaggtg ggtggattgc ttgagctcag 1920 
gagttcaaga ccagcctggg caacacggcg aaacccagtc tctaccaaaa atacacacac 1980 
acacacacac acacacacac acacacacac acacacacaa attggcctag cgtggtgtcg 2040 
tgtgtctgtg gtcgcagtta ctcaggagac caaggtagga ggtaggaaac caaggtagga 2100 
ggatcacccg aggtcggtag ttcgagacca gcctgaccaa catggagaaa ccctgtcttt 2160 
actaaaaata caaaattagc tgggcgtggt ggtgcatgcc tgtaattcca gctacttggg 2220 
aggctgagac aggagaatgg cttgaacccg gaaggcggag tttgcggtga gctgagatcg 2280 



cgtcattgca ctccagcctg ggcaacaaga gcaaaactcc gtctcaaaaa aaaagaaata 2340 
tatatatata tatgtgtgtg tgtgtgtgtg tgtgtgtatg tatatatata tatgtatgtg 2400 
tatatatatg tatgtgtgtg tgtgtgcata tatatataca ctttgtttaa ttgtaagtgt 2460 
gtttagttta atttttaata atgtccgtga ttaacagctg gctggcaaga ttcctgagaa 2520 
ctgaagagtt tgccccagcc catccagcac accatgggcc cagggcagac cttggggcta 2580 
ggcggtcttg ggttccagag ggctcccatg cccctgtcct attgctcttc tggcaatagg 2640 
acatttacgc gggggggggg gtggttcttg attctgggtc ttttagggga ctctgtgatt 2700 
aagaaacagc agggatgttg caacagcagg gatgaggtgg gcctggggac gggtcagtga 2760 
agggtcttca ttcctagctg ctgacctgat ctgccctgag ataaaagact aagacccaga 2820 
gagtgaacgc tgtccgcggg ggcagaagcg agtgaggcgt cgggacagtg gggcataacc 2880 
aagagcaaaa cgcaaactga gacttcagcg ccggtttctc gggccagccc acgcctcctg 2940 
cctcagctca atgccactcc ctccccgcca agtggctctc cgctctggag gcgggaccga 3000 
gttctccggt ggcccctgga ggctccggca gcgagctctg ggaggctggg aggggagtga 3060 
ggggaggggc gctgactggg ccgtccaaag aggagggggc ctttaatagg ctcgcccagc 3120 
gcctggcttg ctgcgctgcg agtggctgcg gttgcgagaa gccgcccggc accttccgct 3180 
agttctcggc tgcaaatctt cgtccttgca cttgacagcg attgtactta agctcccagg 3240 
gcgcgctttg cttggaaagg cacaggtagg aagcgcgggc tgccgggtgc acgctcgccg 3300 
ccctgggagg agtctccctc ccttggctct cctttctggg aactgccggc tgtcccgtag 3360 
cgttggcggt tccagagtgc gggctgcacg gagaccgcgg cagcggccgg agagcccggc 3420 
ccagcccctt cccacagcgc ggcggtgcgc tgcccggcgc catg 3464 

<210> 27 

<211> 3469 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> promoter 
<222> 



<400> 27 

ttgcacagct aagatctggt ggaggcatgc 
tcactgtact atgttccctt ccataggcct 
ttatcttagg tattatctag accacagatt 
gagctaggat aaccccttct cttttgacag 
gtggagtgcc aatcctgagt atcacctcta 
aattccctgg caaaagtgat tggatggaac 
ttaaatacga gaatgcacat ataactcttg 

* atacatgcta atccctttga gcaaatccac 
tagctgggct gtgattgggg cagggggagc 
ccctgccatc ttgactgaca tgctgcagcc 
taaagtgccg ttgaaaacct tgcaatccac 
tgcttgcccc aaataaatga cttatcattg 
taggccctct ttgaacgctc aaacacacag 
cacacagtcc tcccataccc atacactctc 
tcactcattc acataatgaa tttcagctcc 
gaattatggg catctgagcc atcttcatgt 

j ggatccttta tggggagaag ataatgggca 
agccttttct ctccttggac aatagagttc 
attccaaaat tctcccacat catccccttt 
tgtcctcccc tcttgcaaac tcaactcaga 
atcaggagag agtagcaaag cctccctcct 
aagttgattc tgcggagagg taagaaggat 
ggctgttctc caaactagat gggaacataa 
cactcggcct cctatgccag tccccagtcc 
ttcatggagg aagcctggcc cgatttttct 
tgtaactgca gccccagaac ctgatctcca 
gggctcatct tcctgccccc aaccccagct 



acacagggcc 


ctctgaccat 


gggctctaaa 


60 


caatcagtca 


tgtaatattt 


gacctggtcg 


120 


ttggatgcag 


ttctctggct 


gaagacctct 


180 


acgagtcaga 


gaatcagatc 


agtgatagaa 


240 


ctcaagtgct 


caacatatcc 


ctagatcctc 


300 


cacaggcttc 


caagagggga 


cagtcaagca 


360 


gtgcaatgtt 


tagcacatac 


taagcctgca 


420 


atggccagtt 


tctgtgctca 


ggggtgagaa 


480 


actaagtggg 


agggacttcc 


tgtctcaggt 


540 


cttgccaaaa 


cccatgggtc 


agaatgaaag 


600 


ctttaaaact 


gccgggtgta 


gtaaaacaat 


660 


ctgttggttg 


tctgcgtttc 


tctttaatta 


720 


ggcctttgta 


agcttgaact 


ccctgtctca 


780 


tttcatttgc 


agagtataaa 


cacccatctc 


840 


ttgtgtccca 


atcaaggaga 


ggcctcactg 


900 


tccaaggccc 


cagggggcgc 


ttccaagagt 


960 


aaaagtgctc 


ttcactgatg 


gaccagtccc 


1020 


ttcccttgaa 


cagccacttc 


cctaaaaaaa 


1080 


atgcttaaaa 


tcatcacaca 


ctcccttctt 


1140 


gccctttggc 


tccagaaaga 


ttttctaggt 


1200 


ctccttgcct 


ttctcccttg 


tcagagaaag 


1260 


cttgaggtct 


agagcctgaa 


aaactccttg 


1320 


ggtgcgattg 


catcttctcc 


agctgatact 


1380 


agggtttggt 


caagggtcaa 


atgagataat 


1440 


actgtttgct 


ggaagacagc 


ctcttcctct 


1500 


catccctgcc 


aggcaggtag 


ctgtgtacaa 


1560 


ctgatttgct 


tattcaggtg 


gtgtaaatac 


1620 



ttctaccagg 


acctatttca 


agccattgtg 


atgtccctga 


ctggggagat gcagggcagc 


1680 


acaccattta 


atatttccct 


cacatttcca 


ccccattctg 


cactcttttc tgggagttgc 


1740 


tgtctcagag 


ggttggcggt 


tctggtggct 


caagaccata 


agtaattatc aaatacttag 


1800 


gaagcgacgg 


gttttgagta 


tttattacct 


tttaaaaatg 


tactttgtgg ctaggcatgg 


1860 


tggctcacgc 


ctgtagtccc 


cgcaccggga 


ggccgaggtg 


ggtggattgc ttgagctcag 


1920 


gagttcaaga 


ccagcctggg 


caacacggcg 


aaacccagtc 


tctaccaaaa atacacacac 


1980 


acacacacac 


acacacacac 


acacacacac 


acacacacac 


acaaattggc ctagcgtggt 


2040 


gtcgtgtgtc 


tgtggtcgca 


gttactcagg 


agaccaaggt 


aggaggtagg aaaccaaggt 


2100 


aggaggatca 


cccgaggtcg 


gtagttcgag 


accagcctga 


ccaacatgga gaaaccctgt 


2160 


ctttactaaa 


aatacaaaat 


tagctgggcg 


tggtggtgca 


tgcctgtaat tccagctact 


2220 


tgggaggctg 


agacaggaga 


atggcttgaa 


cccggaaggc 


ggagtttgcg gtgagctgag 


2280 


atcgcgtcat 


tgcactccag 


cctgggcaac 


aagagcaaaa 


ctccgtctca aaaaaaaaga 


2340 


aatatatata 


tatatgtgtg 


tgtgtgtgtg 


tgtgtgtgta 


tgtatatata tatatgtatg 


2400 


tgtatatata 


tgtatgtgtg 


tgtgtgtgca 


tatatatata 


tacactttgt ttaattgtaa 


2460 


gtgtgtttag 


tttaattttt 


aataatgtcc 


gtgattaaca 


gctggctggc aagattcctg 


2520 


agaactgaag 


agtttgcccc 


agcccatcca 


gcacaccatg 


ggcccagggc agaccttggg 


2580 


gctaggcggt 


cttgggttcc 


agagggctcc 


catgcccctg 


tcctattgct cttctggcaa 


2640 


taggacattt 


acgcgggggg 


ggggggtggt 


tcttgattct 


gggtctttta ggggactctg 


2700 


tgattaagaa 


acagcaggga 


tgttgcaaca 


gcagggatga 


ggtgggcctg gggacgggtc 


2760 


agtgaagggt 


cttcattcct 


agctgctgac 


ctgatctgcc 


ctgagataaa agactaagac 


2820 


ccagagagtg 


aacgctgtcc 


gcgggggcag 


aagcgagtga 


ggcgtcggga cagtggggca 


2880 


taaccaagag 


caaaacgcaa 


actgagactt 


cagcgccggt 


ttctcgggcc agcccacgcc 


2940 


tcctgcctca 


gctcaatgcc 


actccctccc 


cgccaagtgg 


ctctccgctc tggaggcggg 


3000 


accgagttct 


ccggtggccc 


ctggaggctc 


cggcagcgag 


ctctgggagg ctgggagggg 


3060 


agtgagggga 


ggggcgctga 


ctgggccgtc 


caaagaggag 


ggggccttta ataggctcgc 


3120 


ccagcgcctg 


gcttgctgcg 


ctgcgagtgg 


ctgcggttgc 


gagaagccgc ccggcacctt 


3180 


ccgctagttc 


tcggctgcaa 


atcttcgtcc 


ttgcacttga 


cagcgattgt acttaagctc 


3240 


ccagggcgcg 


ctttgcttgg 


aaaggcacag 


gtaggaagcg 


cgggctgccg ggtgcacgct 


3300 


cgccgccctg 


ggaggagtct 


ccctcccttg 


gctctccttt 


ctgggaactg ccggctgtcc 


3360 



cgtagcgttg gcggttccag agtgcgggct gcacggagac cgcggcagcg gccggagagc 3420 
ccggcccagc cccttcccac agcgcggcgg tgcgctgccc ggcgccatg 3469 



<210> 28 
<211> 3470 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> promoter 
<222> 



<400> 28 






ttgcacagct 


aagatctggt 


ggaggcatgc 


tcactgtact 


atgttccctt 


ccataggcct 


ttatcttagg 


tattatctag 


accacagatt 


gagctaggat 


accccttctc 


ttttgacaga 


tggagtgcca 


atcctgagta 


tcacctctac 


attccctggc 


aaaagtgatt 


ggatggaacc 


taaatacgag 


aatgcacata 


taactcttgg 


tacatgctaa 


tccctttgag 


caaatccaca 


agctgggctg 


tgattggggc 


agggggagca 


cctgccatct 


tgactgacat 


gctgcagccc 


aaagtgccgt 


tgaaaacctt 


gcaatccacc 


gcttgcccca 


aataaatgac 


ttatcattgc 


aggccctctt 


tgaacgctca 


aacacacagg 


acacagtcct 


cccataccca 


tacactctct 


cactcattca 


cataatgaat 


ttcagctcct 


aattatgggc 


atctgagcca 


tcttcatgtt 



acacagggcc ctctgaccat gggctctaaa 60 
caatcagtca tgtaatattt gacctggtcg 120 
ttggatgcag ttctctggct gaagacctct 180 
cgagtcagag aatcagatca gtgatagaag 240 
tcaagtgctc aacatatccc tagatcctca 300 
acaggcttcc aagaggggac agtcaagcat 360 
tgcaatgttt agcacatact aagcctgcaa 420 
tggccagttt ctgtgctcag gggtgagaat 480 
ctaagtggga gggacttcct gtctcaggtc 540 
ttgccaaaac ccatgggtca gaatgaaagt 600 
tttaaaactg ccgggtgtag taaaacaatt 660 
tgttggttgt ctgcgtttct ctttaattat 720 
gcctttgtaa gcttgaactc cctgtctcac 780 
ttcatttgca gagtataaac acccatctct 840 
tgtgtcccaa tcaaggagag gcctcactgg 900 
ccaaggcccc agggggcgct tccaagagtg 960 



gatcctttat 


ggggagaaga 


taatgggcaa 


aaagtgctct 


tcactgatgg 


accagtccca 


1020 


gccttttctc 


tccttggaca 


atagagttct 


tcccttgaac 


agccacttcc 


ctaaaaaaaa 


1080 


ttccaaaatt 


ctcccacatc 


atccccttta 


tgcttaaaat 


catcacacac 


tcccttcttt 


1140 


gtcctcccct 


cttgcaaact 


caactcagag 


ccctttggct 


ccagaaagat 


tttctaggta 


1200 


tcaggagaga 


gtagcaaagc 


ctccctcctc 


tccttgcctt 


tctcccttgt 


cagagaaaga 


1260 


agttgattct 


gcggagaggt 


aagaaggatc 


ttgaggtcta 


gagcctgaaa 


aactccttgg 


1320 


gctgttctcc 


aaactagatg 


ggaacataag 


gtgcgattgc 


atcttctcca 


gctgatactc 


1380 


actcggcctc 


ctatgccagt 


ccccagtcca 


gggtttggtc 


aagggtcaaa 


tgagataatt 


1440 


tcatggagga 


agcctggccc 


gatttttcta 


ctgtttgctg 


gaagacagcc 


tcttcctctt 


1500 


gtaactgcag 


ccccagaacc 


tgatctccac 


atccctgcca 


ggcaggtagc 


tgtgtacaag 


1560 


ggctcatctt 


cctgccccca 


accccagctc 


tgatttgctt 


attcaggtgg 


tgtaaatact 


1620 


tctaccagga 


cctatttcaa 


gccattgtga 


tgtccctgac 


tggggagatg 


cagggcagca 


1680 


caccatttaa 


tatttccctc 


acatttccac 


cccattctgc 


actcttttct 


gggagttgct 


1740 


gtctcagagg 


gttggcggtt 


ctggtggctc 


aagaccataa 


gtaattatca 


aatacttagg 


1800 


aagcgacggg 


ttttgagtat 


ttattacctt 


ttaaaaatgt 


actttgtggc 


taggcatggt 


1860 


ggctcacgcc 


tgtagtcccc 


gcaccgggag 


gccgaggtgg 


gtggattgct 


tgagctcagg 


1920 


agttcaagac 


cagcctgggc 


aacacggcga 


aacccagtct 


ctaccaaaaa 


tacacacaca 


1980 


cacacacaca 


cacacacaca 


cacacacaca 


cacacacaca 


caaattggcc 


tagcgtggtg 


2040 


tcgtgtgtct 


gtggtcgcag 


ttactcagga 


gaccaaggta 


ggaggtagga 


aaccaaggta 


2100 


ggaggatcac 


ccgaggtcgg 


tagttcgaga 


ccagcctgac 


caacatggag 


aaaccctgtc 


2160 


tttactaaaa 


atacaaaatt 


agctgggcgt 


ggtggtgcat 


gcctgtaatt 


ccagctactt 


2220 


gggaggctga 


gacaggagaa 


tggcttgaac 


ccggaaggcg 


gagtttgcgg 


tgagctgaga 


2280 


tcgcgtcatt 


gcactccagc 


ctgggcaaca 


agagcaaaac 


tccgtctcaa 


aaaaaaagaa 


2340 


atatatatat 


atatgtgtgt 


gtgtgtgtgt 


gtgtgtgtat 


gtatatatat 


atatgtatgt 


2400 


gtatatatat 


gtatgtgtgt 


gtgtgtgcat 


atatatatat 


acactttgtt 


taattgtaag 


2460 




ttaattttta 


a log Lg i/tug 
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2520 


gaactgaaga 


gtttgcccca 


gcccatccag 


cacaccatgg 


gcccagggca 


gaccttgggg 


2580 


ctaggcggtc 


ttgggttcca 


gagggctccc 


atgcccctgt 


cctattgctc 


ttctggcaat 


2640 


aggacattta 


cgcggggggg 


gggggggtgg 


ttcttgattc 


tgggtctttt 


aggggactct 


2700 



gtgattaaga 


aacagcaggg 


atgttgcaac 


cagtgaaggg 


tcttcattcc 


tagctgctga 


cccagagagt 


gaacgctgtc 


cgcgggggca 


ataaccaaga 


gcaaaacgca 


aactgagact 


ctcctgcctc 


agctcaatgc 


cactccctcc 


gaccgagttc 


tccggtggcc 


cctggaggct 


gagtgagggg 


aggggcgctg 


actgggccgt 


cccagcgcct 


ggcttgctgc 


gctgcgagtg 


tccgctagtt 


ctcggctgca 


aatcttcgtc 


cccagggcgc 


gctttgcttg 


gaaaggcaca 


tcgccgccct 


gggaggagtc 


tccctccctt 


ccgtagcgtt 


ggcggttcca 


gagtgcgggc 


cccggcccag 


ccccttccca 


cagcgcggcg 



<210> 29 

<211> 3467 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> promoter 
<222> 

<400> 29 

ttgcacagct aagatctggt ggaggcatgc 
tcactgtact atgttccctt ccataggcct 
ttatcttagg tattatctag accacagatt 
gagctaggat aaccccttct cttttgacag 
gtggagtgcc aatcctgagt atcacctcta 



agcagggatg 


aggtgggcct 


ggggacgggt 


2760 


cctgatctgc 


cctgagataa 


aagactaaga 


2820 


gaagcgagtg 


aggcgtcggg 


acagtggggc 


2880 


tcagcgccgg 


tttctcgggc 


cagcccacgc 


2940 


ccgccaagtg 


gctctccgct 


ctggaggcgg 


3000 


ccggcagcga 


gctctgggag 


gctgggaggg 


3060 


ccaaagagga 


gggggccttt 


aataggctcg 


3120 


gctgcggttg 


cgagaagccg 


cccggcacct 


3180 


cttgcacttg 


acagcgattg 


tacttaagct 


3240 


ggtaggaagc 


gcgggctgcc 


gggtgcacgc 


3300 


ggctctcctt 


tctgggaact 


gccggctgtc 


3360 


tgcacggaga 


ccgcggcagc 


ggccggagag 


3420 


gtgcgctgcc 


cggcgccatg 




3470 


acacagggcc 


ctctgaccat 


gggctctaaa 


60 


caatcagtca 


tgtaatattt 


gacctggtcg 


120 


ttggatgcag 


ttctctggct 


gaagacctct 


180 


acgagtcaga 


gaatcagatc 


agtgatagaa 


240 


ctcaagtgct 


caacatatcc 


ctagatcctc 


300 



aattccctgg 


caaaagtgat 


tggatggaac 


cacaggcttc 


caagagggga 


cagtcaagca 


360 


ttaaatacga 


gaatgcacat 


ataactcttg 


gtgcaatgtt 


tagcacatac 


taagcctgca 


420 


atacatgcta 


atccctttga 


gcaaatccac 


atggccagtt 


tctgtgctca 


ggggtgagaa 


480 


tagctgggct 


gtgattgggg 


cagggggagc 


actaagtggg 


agggacttcc 


tgtctcaggt 


540 


ccctgccatc 


ttgactgaca 


tgctgcagcc 


cttgccaaaa 


cccatgggtc 


agaatgaaag 


600 


taaagtgccg 


ttgaaaacct 


tgcaatccac 


ctttaaaact 


gccgggtgta 


gtaaaacaat 


660 


tgcttgcccc 


aaataaatga 


cttatcattg 


ctgttggttg 


tctgcgtttc 


tctttaatta 


720 


taggccctct 


ttgaacgctc 


aaacacacag 


ggcctttgta 


agcttgaact 


ccctgtctca 


780 


cacacagtcc 


tcccataccc 


atacactctc 


tttcatttgc 


agagtataaa 


cacccatctc 


840 


tcactcattc 


acataatgaa 


tttcagctcc 


ttgtgtccca 


atcaaggaga 


ggcctcactg 


900 


gaattatggg 


catctgagcc 


atcttcatgt 


tccaaggccc 


cagggggcgc 


ttccaagagt 


960 


ggatccttta 


tggggagaag 


ataatgggca 


aaaagtgctc 


ttcactgatg 


gaccagtccc 


1020 


agccttttct 


ctccttggac 


aatagagttc 


ttcccttgaa 


cagccacttc 


cctaaaaaaa 


1080 


attccaaaat 


tctcccacat 


catccccttt 


atgcttaaaa 


tcatcacaca 


ctcccttctt 


1140 


tgtcctcccc 


tcttgcaaac 


tcaactcaga 


gccctttggc 


tccagaaaga 


ttttctaggt 


1200 


atcaggagag 


agtagcaaag 


cctccctcct 


ctccttgcct 


t.tctcccttg 


tcagagaaag 


1260 


aagttgattc 


tgcggagagg 


taagaaggat 


cttgaggtct 


agagcctgaa 


aaactccttg 


1320 


ggctgttctc 


caaactagat 


gggaacataa 


ggtgcgattg 


catcttctcc 


agctgatact 


1380 


cactcggcct 


cctatgccag 


tccccagtcc 


agggtttggt 


caagggtcaa 


atgagataat 


1440 


ttcatggagg 


aagcctggcc 


cgatttttct 


actgtttgct 


ggaagacagc 


ctcttcctct 


1500 


tgtaactgca 


gccccagaac 


ctgatctcca 


catccctgcc 


aggcaggtag 


ctgtgtacaa 


1560 


gggctcatct 


tcctgccccc 


aaccccagct 


ctgatttgct 


tattcaggtg 


gtgtaaatac 


1620 


ttctaccagg 


acctatttca 


agccattgtg 


atgtccctga 


ctggggagat 


gcagggcagc 


1680 


acaccattta 


atatttccct 


cacatttcca 


ccccattctg 


cactcttttc 


tgggagttgc 


1740 


tgtctcagag 


ggttggcggt 


tctggtggct 


caagaccata 


agtaattatc 


aaatacttag 


1800 


gaagcgacgg 


gttttgagta 


tttattacct 


tttaaaaatg 


tactttgtgg 


ctaggcatgg 


1860 


tggctcacgc 


ctgtagtccc 


cgcaccggga 


ggccgaggtg 


ggtggattgc 


ttgagctcag 


1920 


gagttcaaga 


ccagcctggg 


caacacggcg 


aaacccagtc 


tctaccaaaa 


atacacacac 


1980 


acacacacac 


acacacacac 


acacacacac 


acacacacac 


acaaattggc 


ctagcgtggt 


2040 



gtcgtgtgtc tgtggtcgca gttactcagg agaccaaggt aggaggtagg aaaccaaggt 2100 
aggaggatca cccgaggtcg gtagttcgag accagcctga ccaacatgga gaaaccctgt 2160 
ctttactaaa aatacaaaat tagctgggcg tggtggtgca tgcctgtaat tccagctact 2220 
tgggaggctg agacaggaga atggcttgaa cccggaaggc ggagtttgcg gtgagctgag 2280 
atcgcgtcat tgcactccag cctgggcaac aagagcaaaa ctccgtctca aaaaaaaaga 2340 
aatatatata tatatgtgtg tgtgtgtgtg tgtgtgtgtg tatgtatata tatatatgta 2400 
tgtgtatata tatgtatgtg tgtgtgtgtg catatatata tacactttgt ttaattgtaa 2460 
gtgtgtttag tttaattttt aataatgtcc gtgattaaca gctggctggc aagattcctg 2520 
agaactgaag agtttgcccc agcccatcca gcacaccatg ggcccagggc agaccttggg 2580 
gctaggcggt cttgggttcc agagggctcc catgcccctg tcctattgct cttctggcaa 2640 
taggacattt acgcgggggg ggggtggttc ttgattctgg gtcttttagg ggactctgtg 2700 
attaagaaac agcagggatg ttgcaacagc agggatgagg tgggcctggg gacgggtcag 2760 
tgaagggtct tcattcctag ctgctgacct gatctgccct gagataaaag actaagaccc 2820 
agagagtgaa cgctgtccgc gggggcagaa gcgagtgagg cgtcgggaca gtggggcata 2880 
accaagagca aaacgcaaac tgagacttca gcgccggttt ctcgggccag cccacgcctc 2940 
ctgcctcagc tcaatgccac tccctccccg ccaagtggct ctccgctctg gaggcgggac 3000 
cgagttctcc ggtggcccct ggaggctccg gcagcgagct ctgggaggct gggaggggag 3060 
tgaggggagg ggcgctgact gggccgtcca aagaggaggg ggcctttaat aggctcgccc 3120 
agcgcctggc ttgctgcgct gcgagtggct gcggttgcga gaagccgccc ggcaccttcc 3180 
gctagttctc ggctgcaaat cttcgtcctt gcacttgaca gcgattgtac ttaagctccc 3240 
agggcgcgct ttgcttggaa aggcacaggt aggaagcgcg ggctgccggg tgcacgctcg 3300 
ccgccctggg aggagtctcc ctcccttggc tctcctttct gggaactgcc ggctgtcccg 3360 
tagcgttggc ggttccagag tgcgggctgc acggagaccg cggcagcggc cggagagccc 3420 
ggcccagccc cttcccacag cgcggcggtg cgctgcccgg cgccatg 3467 

<210> 30 

<211> 3462 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> promoter 
<222> 



<400> 30 

ttgcacagct aagatctggt ggaggcatgc 
tcactgtact atgttccctt ccataggcct 
ttatcttagg tattatctag accacagatt 
gagctaggat aaccccttct cttttgacag 
gtggagtgcc aatcctgagt atcacctcta 
aattccctgg caaaagtgat tggatggaac 
ttaaatacga gaatgcacat ataactcttg 
atacatgcta atccctttga gcaaatccac 
tagctgggct gtgattgggg cagggggagc 
ccctgccatc ttgactgaca tgctgcagcc 
taaagtgccg ttgaaaacct tgcaatccac 
tgcttgcccc aaataaatga cttatcattg 
taggccctct ttgaacgctc aaacacacag 
cacacagtcc tcccataccc atacactctc 
tcactcattc acataatgaa tttcagctcc 
gaattatggg catctgagcc atcttcatgt 
ggatccttta tggggagaag ataatgggca 
agccttttct ctccttggac aatagagttc 
attccaaaat tctcccacat catccccttt 
tgtcctcccc tcttgcaaac tcaactcaga 
atcaggagag agtagcaaag cctccctcct 
aagttgattc tgcggagagg taagaaggat 
ggctgttctc caaactagat gggaacataa 



acacagggcc ctctgaccat gggctctaaa 60 
caatcagtca tgtaatattt gacctggtcg 120 
ttggatgcag ttctctggct gaagacctct 180 
acgagtcaga gaatcagatc agtgatagaa 240 
ctcaagtgct caacatatcc ctagatcctc 300 
cacaggcttc caagagggga cagtcaagca 360 
gtgcaatgtt tagcacatac taagcctgca 420 
atggccagtt tctgtgctca ggggtgagaa 480 
actaagtggg agggacttcc tgtctcaggt 540 
cttgccaaaa cccatgggtc agaatgaaag 600 
ctttaaaact gccgggtgta gtaaaacaat 660 
ctgttggttg tctgcgtttc tctttaatta 720 
ggcctttgta agcttgaact ccctgtctca 780 
tttcatttgc agagtataaa cacccatctc 840 
ttgtgtccca atcaaggaga ggcctcactg 900 
tccaaggccc cagggggcgc ttccaagagt 960 
aaaagtgctc ttcactgatg gaccagtccc 1020 
ttcccttgaa cagccacttc cctaaaaaaa 1080 
atgcttaaaa tcatcacaca ctcccttctt 1140 
gccctttggc tccagaaaga ttttctaggt 1200 
ctccttgcct ttctcccttg tcagagaaag 1260 
cttgaggtct agagcctgaa aaactccttg 1320 
ggtgcgattg catcttctcc agctgatact 1380 



cactcggcct 


cctatgccag 


tccccagtcc 


ttcatggagg 


aagcctggcc 


cgatttttct 


tgtaactgca 


gccccagaac 


ctgatctcca 


gggctcatct 


tcctgccccc 


aaccccagct 


ttctaccagg 


acctatttca 


agccattgtg 


acaccattta 


atatttccct 


cacatttcca 


tgtctcagag 


ggttggcggt 


tctggtggct 


gaagcgacgg 


gttttgagta 


tttattacct 


tggctcacgc 


ctgtagtccc 


cgcaccggga 


gagttcaaga 


ccagcctggg 


caacacggcg 


acacacacac 


acacacacac 


acacacacac 


tgtgtctgtg 


gtcgcagtta 


ctcaggagac 


ggatcacccg 


aggtcggtag 


ttcgagacca 


actaaaaata 


caaaattagc 


tgggcgtggt 


aggctgagac 


aggagaatgg 


cttgaacccg 


cgtcattgca 


ctccagcctg 


ggcaacaaga 


tatatatata 


tgtgtgtgtg 


tgtgtgtgtg 


tatatatgta 


tgtgtgtgtg 


tgtgcatata 


gtttagttta 


atttttaata 


atgtccgtga 


ctgaagagtt 


tgccccagcc 


catccagcac 


ggcggtcttg 


ggttccagag 


ggctcccatg 


acatttacgc 


gggggggggt 


ggttcttgat 


gaaacagcag 


ggatgttgca 


acagcaggga 


ggtcttcatt 


cctagctgct 


gacctgatct 


gtgaacgctg 


tccgcggggg 


cagaagcgag 


gagcaaaacg 


caaactgaga 


cttcagcgcc 


tcagctcaat 


gccactccct 


ccccgccaag 


tctccggtgg 


cccctggagg 


ctccggcagc 


ggaggggcgc 


tgactgggcc 


gtccaaagag 



agggtttggt caagggtcaa atgagataat 1440 
actgtttgct ggaagacagc ctcttcctct 1500 
catccctgcc aggcaggtag ctgtgtacaa 1560 
ctgatttgct tattcaggtg gtgtaaatac 1620 
atgtccctga ctggggagat gcagggcagc 1680 
ccccattctg cactcttttc tgggagttgc 1740 
caagaccata agtaattatc aaatacttag 1800 
tttaaaaatg tactttgtgg ctaggcatgg 1860 
ggccgaggtg ggtggattgc ttgagctcag 1920 
aaacccagtc tctaccaaaa atacacacac 1980 
acacacacaa attggcctag cgtggtgtcg 2040 
caaggtagga ggtaggaaac caaggtagga 2100 
gcctgaccaa catggagaaa ccctgtcttt 2160 
ggtgcatgcc tgtaattcca gctacttggg 2220 
gaaggcggag tttgcggtga gctgagatcg 2280 
gcaaaactcc gtctcaaaaa aaaagaaata 2340 
tgtgtatgta tatatatata tgtatgtgta 2400 
tatatataca ctttgtttaa ttgtaagtgt 2460 
ttaacagctg gctggcaaga ttcctgagaa 2520 
accatgggcc cagggcagac cttggggcta 2580 
cccctgtcct attgctcttc tggcaatagg 2640 
tctgggtctt ttaggggact ctgtgattaa 2700 
tgaggtgggc ctggggacgg gtcagtgaag 2760 
gccctgagat aaaagactaa gacccagaga 2820 
tgaggcgtcg ggacagtggg gcataaccaa 2880 
ggtttctcgg gccagcccac gcctcctgcc 2940 
tggctctccg ctctggaggc gggaccgagt 3000 
gagctctggg aggctgggag gggagtgagg 3060 
gagggggcct ttaataggct cgcccagcgc 3120 



ctggcttgct 


gcgctgcgag 


tggctgcggt 


ttctcggctg 


caaatcttcg 


tccttgcact 


gcgctttgct 


tggaaaggca 


caggtaggaa 


ctgggaggag 


tctccctccc 


ttggctctcc 


ttggcggttc 


cagagtgcgg 


gctgcacgga 


agccccttcc 


cacagcgcgg 


cggtgcgctg 



<210> 31 
<211> 3455 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> promoter 
<222> 



<400> 31 






ttgcacagct 


aagatctggt 


ggaggcatgc 


tcactgtact 


atgttccctt 


ccataggcct 


ttatcttagg 


tattatctag 


accacagatt 


gagctaggat 


aaccccttct 


cttttgacag 


gtggagtgcc 


aatcctgagt 


atcacctcta 


aattccctgg 


caaaagtgat 


tggatggaac 


ttaaatacga 


gaatgcacat 


ataactcttg 


atacatgcta 


atccctttga 


gcaaatccac 


tagctgggct 


gtgattgggg 


cagggggagc 


ccctgccatc 


ttgactgaca 


tgctgcagcc 


taaagtgccg 


ttgaaaacct 


tgcaatccac 


tgcttgcccc 


aaataaatga 


cttatcattg 



tgcgagaagc 


cgcccggcac cttccgctag 


3180 


tgacagcgat 


tgtacttaag ctcccagggc 


3240 


gcgcgggctg 


ccgggtgcac gctcgccgcc 


3300 


tttctgggaa 


ctgccggctg tcccgtagcg 


3360 


gaccgcggca 


gcggccggag agcccggccc 


3420 


cccggcgcca 


tg 


3462 


acacagggcc 


ctctgaccat gggctctaaa 


60 


caatcagtca 


tgtaatattt gacctggtcg 


120 


ttggatgcag 


ttctctggct gaagacctct 


180 


acgagtcaga 


gaatcagatc agtgatagaa 


240 


ctcaagtgct 


caacatatcc ctagatcctc 


300 


cacaggcttc 


caagagggga cagtcaagca 


360 


gtgcaatgtt 


tagcacatac taagcctgca 


420 


atggccagtt 


tctgtgctca ggggtgagaa 


480 


actaagtggg 


agggacttcc tgtctcaggt 


540 


cttgccaaaa 


cccatgggtc agaatgaaag 


600 


ctttaaaact 


gccgggtgta gtaaaacaat 


660 


ctgttggttg 


tctgcgtttc tctttaatta 


720 



taggccctct 


ttgaacgctc 


aaacacacag 


ggcctttgta 


agcttgaact 


ccctgtctca 


780 


cacacagtcc 


tcccataccc 


atacactctc 


tttcatttgc 


agagtataaa 


cacccatctc 


840 


tcactcattc 


acataatgaa 


tttcagctcc 


ttgtgtccca 


atcaaggaga 


ggcctcactg 


900 


gaattatggg 


catctgagcc 


atcttcatgt 


tccaaggccc 


cagggggcgc 


ttccaagagt 


960 


ggatccttta 


tggggagaag 


ataatgggca 


aaaagtgctc 


ttcactgatg 


gaccagtccc 


1020 


agccttttct 


ctccttggac 


aatagagttc 


ttcccttgaa 


cagccacttc 


cctaaaaaaa 


1080 


attccaaaat 


tctcccacat 


catccccttt 


atgcttaaaa 


tcatcacaca 


ctcccttctt 


1140 


tgtcctcccc 


tcttgcaaac 


tcaactcaga 


gccctttggc 


tccagaaaga 


ttttctaggt 


1200 


atcaggagag 


agtagcaaag 


cctccctcct 


ctccttgcct 


ttctcccttg 


tcagagaaag 


1260 


aagttgattc 


tgcggagagg 


taagaaggat 


cttgaggtct 


agagcctgaa 


aaactccttg 


1320 


ggctgttctc 


caaactagat 


gggaacataa 


ggtgcgattg 


catcttctcc 


agctgatact 


1380 


cactcggcct 


cctatgccag 


tccccagtcc 


agggtttggt 


caagggtcaa 


atgagataat 


1440 


ttcatggagg 


aagcctggcc 


cgatttttct 


actgtttgct 


ggaagacagc 


ctcttcctct 


1500 


tgtaactgca 


gccccagaac 


ctgatctcca 


catccctgcc 


aggcaggtag 


ctgtgtacaa 


1560 


gggctcatct 


tcctgccccc 


aaccccagct 


ctgatttgct 


tattcaggtg 


gtgtaaatac 


1620 


ttctaccagg 


acctatttca 


agccattgtg 


atgtccctga 


ctggggagat 


gcagggcagc 


1680 


acaccattta 


atatttccct 


cacatttcca 


ccccattctg 


cactcttttc 


tgggagttgc 


1740 


tgtctcagag 


ggttggcggt 


tctggtggct 


caagaccata 


agtaattatc 


aaatacttag 


1800 


gaagcgacgg 


gttttgagta 


tttattacct 


tttaaaaatg 


tactttgtgg 


ctaggcatgg 


1860 


tggctcacgc 


ctgtagtccc 


cgcaccggga 


ggccgaggtg 


ggtggattgc 


ttgagctcag 


1920 


gagttcaaga 


ccagcctggg 


caacacggcg 


aaacccagtc 


tctaccaaaa 


atacacacac 


1980 


acacacacac 


acacacacac 


acacacacac 


acacacaaat 


tggcctagcg 


tggtgtcgtg 


2040 


tgtctgtggt 


cgcagttact 


caggagacca 


aggtaggagg 


taggaaacca 


aggtaggagg 


2100 


atcacccgag 


gtcggtagtt 


cgagaccagc 


ctgaccaaca 


tggagaaacc 


ctgtctttac 


2160 


taaaaataca 


aaattagctg 


ggcgtggtgg 


tgcatgcctg 


taattccagc 


tacttgggag 


2220 


gctgagacag 


gagaatggct 


tgaacccgga 


aggcggagtt 


tgcggtgagc 


tgagatcgcg 


2280 


tcattgcact 


ccagcctggg 


caacaagagc 


aaaactccgt 


ctcaaaaaaa 


aaaaaatata 


2340 


tatatatgtg 


tgtgtgtgtg 


tgtgtgtgtg 


tatgtatata 


tatatatgta 


tgtgtatata 


2400 


tatgtatgtg 


tgtgtgtgca 


tatatatata 


tacactttgt 


ttaattgtaa 


gtgtgtttag 


2460 



tttaattttt aataatgtcc gtgattaaca 
agtttgcccc agcccatcca gcacaccatg 
cttgggttcc agagggctcc catgcccctg 
acgcgggggg ggtggttctt gattctgggt 
cagggatgtt gcaacagcag ggatgaggtg 
attcctagct gctgacctga tctgccctga 
ctgtccgcgg gggcagaagc gagtgaggcg 
acgcaaactg agacttcagc gccggtttct 
aatgccactc cctccccgcc aagtggctct 
tggcccctgg aggctccggc agcgagctct 
cgctgactgg gccgtccaaa gaggaggggg 
gctgcgctgc gagtggctgc ggttgcgaga 
ctgcaaatct tcgtccttgc acttgacagc 
gcttggaaag gcacaggtag gaagcgcggg 
gagtctccct cccttggctc tcctttctgg 
ttccagagtg cgggctgcac ggagaccgcg 
tcccacagcg cggcggtgcg ctgcccggcg 



gctggctggc aagattcctg agaactgaag 2520 
ggcccagggc agaccttggg gctaggcggt 2580 
tcctattgct cttctggcaa taggacattt 2640 
cttttagggg actctgtgat taagaaacag 2700 
ggcctgggga cgggtcagtg aagggtcttc 2760 
gataaaagac taagacccag agagtgaacg 2820 
tcgggacagt ggggcataac caagagcaaa 2880 
cgggccagcc cacgcctcct gcctcagctc 2940 
ccgctctgga ggcgggaccg agttctccgg 300.0 
gggaggctgg gaggggagtg aggggagggg 3060 
cctttaatag gctcgcccag cgcctggctt 3120 
agccgcccgg caccttccgc tagttctcgg 3180 
gattgtactt aagctcccag ggcgcgcttt 3240 
ctgccgggtg cacgctcgcc gccctgggag 3300 
gaactgccgg ctgtcccgta gcgttggcgg 3360 
gcagcggccg gagagcccgg cccagcccct 3420 
ccatg 3455 
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[Fig. 2] 



KD a 



CO 
I — 

CO 
CO 



o 
o 



o 

CD 
> 



200 
122 
80 

51 



Recombinant Aggrecan G1G2 



Degradation Product 



[Fig. 3] 
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1 : No Treatment 
2: IL-1(3 2 hours Treatment 
3: R.A. 2 hours Treatment 
4: IL-1p 6 hours Treatment 
5: R.A. 6 hours Treatment 
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Annex 

(1) page 10, lines 13-18 of English translation of Japanese 
Patent Application No. 2000-144020 

5 

« any of a metalloprotease which contains an amino acid 

sequence of from the 213th position to the 583rd position 
of the amino acid sequence represented by SEQ ID NO:l and 
has a protein degradation ability, or a metalloprotease 
10 which is an equivalent of the me tallopro tease . " 

(2) page 11, lines 3-9 of English translation of Japanese 
Patent Application No. 2000-144020 

15 "... an amino acid sequence of from the 213th position to 
the 950th position of the amino acid sequence represented 
by SEQ ID NO: 1 or an amino acid sequence of from the 213th 
position to the 583rd position of the amino acid sequence 
represented by SEQ ID NO:l and has a protein degradation 

20 ability, or a metalloprotease which is an equivalent of 
this metalloprotease* 

(3) from pacre 11, line 10 to page 12, line 15 of English 
translation of Japanese Patent Application No. 2000-144020 

25 

1 




" Regarding the "equivalent of the metalloprotease" , 

(1) in the case of an equivalent of the metalloprotease 
containing an amino acid sequence of from the 213th 
position to the 583rd position of the amino acid sequence 
5 represented by SEQ ID N0:1 and having a protein degradation 
ability, it is a metalloprotease in which one to several 
amino acid residues may be substituted, deleted and/or 
inserted at one to several positions in the amino acid 
sequence of from the 213th position to the 583rd position, 

10 and which has the protein degradation ability, (2) in the 
case of an equivalent of the metalloprotease containing an 
amino acid sequence of from the 1st position to the 583rd 
position of the amino acid sequence represented by SEQ ID 
NO:l and having a protein degradation ability, it is a 

15 metalloprotease in which one to several amino acid residues 
may be substituted, deleted and/or inserted at one to 
several positions in the amino acid sequence of from the 
1st position to the 583xd position, and which has the 
protein degradation ability, or (3) in the case of an 

20 equivalent of the metalloprotease containing the amino acid 
sequence represented by SEQ ID NO:l, an amino acid sequence 
of from the 1st position to the 583xd position of the amino 
acid sequence represented by SEQ ID NO:l, an amino acid 
sequence of from the 213th position to the 950th position 

25 of the amino acid sequence represented by SEQ ID N0:1 or an 
amino acid sequence of from the 213th position to the 583rd 



position of the amino acid s quence represented by SEQ id 
N0:1 and having a protein degradation ability, it is a 
metalloprotease in which one to several amino acid residues 
may be substituted, deleted and/or inserted at one to 
5 several positions in respective sequences, and which has 
the protein degradation ability. » 

(4) page 13, line 8 to page 15, line 21 of English 
translation of Japanese Paten t Application No. 2000-l440?n 



10 



15 



[0009] 

The gene coding for the novel metalloprotease of the 
invention is any gene which contains a nucleotide sequence 
coding for the metalloprotease, namely a metalloprotease 
gene which contains a nucleotide sequence coding for an 
amino acid sequence of from the 213th position to the 583rd 
position of the amino acid sequence represented by SEQ ID 
NO :1 or an equivalent of the metalloprotease and which has 
a protein degradation ability. 
20 Also, the gene coding for the novel metal loprotein of 

the invention may be any metalloprotease gene which 
contains a nucleotide sequence coding for a metalloprotease 
represented by an amino acid sequence of from the 1st 
position to the 583rd position of the amino acid sequence 
25 shown in SEQ ID NO:l or for an equivalent of the 



3 



metalloprotease and which has a protein degradation 
ability . 

In addition, it may be any metalloprotease gene which 
has a nucleotide sequence coding for a metalloprotease 
5 represented by the amino acid sequence shown in SEQ ID 

N0:1, or by an amino acid sequence of from the 1st position 
to the 583rd position of the amino acid sequence shown in 
SEQ ID NO : 1 , an amino acid sequence of from the 213 th 
position to the 950th position of the amino acid sequence 
10 shown in SEQ ID NO:l or an amino acid sequence of from the 
213th position to the 583rd position of the amino acid 
sequence shown in SEQ ID NO:l, or for an equivalent of the 
metalloprotease, and which has a protein degradation 
ability. 

15 Regarding the "nucleotide sequence coding for an 

equivalent of the metalloprotease" , (1) in the case of a 
nucleotide sequence coding for an equivalent of the 
metalloprotease containing an amino acid sequence of from 
the 213th position to the 583rd position of the amino acid 

20 sequence represented by SEQ ID N0:1 and having a protein 

degradation ability, it is a nucleotide sequence coding for 
a metalloprotease in which one to several amino acid 
residues may be substituted, deleted and/or inserted at one 
to several positions in the amino acid sequence of from the 

25 213th position to the 583rd position, and which has the 
protein degradation ability, (2) in the case of a 



nucleotide sequence coding for an equivalent of the 
metalloprotease containing an amino acid sequence of from 
the 1st position to the 583rd position of the amino acid 
sequence represented by SEQ ID N0:1 and having a protein 
5 degradation ability, it is a nucleotide sequence coding for 
a metalloprotease in which one to several amino acid 
residues may be substituted, deleted and/or inserted at one 
to several positions in the amino acid sequence of from the 
1st position to the 583rd position, and which has the 

10 protein degradation ability, or (3) in the case of a 
nucleotide sequence coding for an equivalent of the 
metalloprotease containing the amino acid sequence 
represented by SEQ ID NO : 1 , an amino acid sequence of from 
the 1st position to the 583rd position of the amino acid 

15 sequence represented by SEQ ID N0:1, an amino acid sequence 
of from the 213th position to the 950th position of the 
amino acid sequence represented by SEQ ID NO: 1 or an amino 
acid sequence of from the 213th position to the 583rd 
position of the amino acid sequence represented by SEQ ID 

20 NO:l and having a protein degradation ability, it is a 

nucleotide sequence coding for a metalloprotease in which 
one to several amino acid residues may be substituted, 
deleted and/or inserted at one to several positions in 
respective sequences, and which has the protein degradation 

25 ability." 



_(5) page 25, line 20 to page 27 , i ine 20 of En? ligtl 
translation of Japanese Patent Application No. 2000-144n?n 

[0017] 

5 2)_ Methods for the pr oduction of the vector of the 

invention, the host cell of th Q in vention and t*» 
recombinant protein of the invention 

The thug isolated fragment containing the gene coding 
for the novel protein of the invention can be transformed 
10 into eucaryotic or proearyotic host cells by again 
integrating it into an appropriate vector DNA. In 
addition, it is possible to express the gene in respective 
host cells by introducing an appropriate promoter and a 
sequence concerned in the gene expression into these 
1 5 vectors . ..." 



6 



